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Preface 


This book is an introduction to detectors in particle physics. It is primar- 


ily aimed at students, who want to understand the instruments that pro- However, we do hope that even more 
. . . 3 experienced colleagues will find one or 
duce the data on which their research is based. This book should also be the other topic interesting and presented 
useful for physicists with little prior knowledge of detector physics who with a new twist. 
want to work on detector development, testing, commissioning or calibra- 
tion/alignment. 
We will assume that the reader has a general knowledge of physics 

at the level of an undergraduate degree (classical mechanics and electro- We consider the breadth of physics 

3 . f Sifi : employed in the development of particle 
magnetism, quantum mechanics, special relativity, thermal physics and detectors one of the appealing aspects 


condensed matter physics). An introductory level knowledge of electronics 9f this field. 
will also be useful for understanding some of the discussion in the chapter 
on electronic signals and noise. 

This book is intended to bridge the gap between the terse description 
in summaries like the PDG [505] and much more detailed and voluminous 
monographs written in the field. It strives to go beyond the simple phe- 
nomenological equations that are often used for back-of-the-envelope cal- 
culations and that have become part of the folklore of the field. It wants to 
give the reader an introduction to the physical models that are used by the 
very powerful simulation programmes that have been developed to pre- 
dict and model the response of particle detectors, but in which the physical 
models are concealed from the casual user. 

This book is divided into three main parts. Chapters 2 to 5 introduce 
the physics and electronics concepts that are underlying the operation of 
all particle physics detectors. These concepts will be required to under- 
stand the workings of detectors as discussed in the latter parts of the book. 
To give the reader an idea of the origin of the equations presented and the 
underlying physics, we do provide derivations of the results in these chap- 
ters. Some of these derivations are fairly advanced, and their details are 
not essential for the required understanding. However, they are included 
to give the interested reader an opportunity and a starting point to obtain 
a deeper understanding of their origin, if needed. Chapter 2 will discuss 
the interactions of incoming particles of various types with the detector 
material, and the resulting response. Most modern particle detectors will 
produce electrical signals, and in chapter 3 we will explore their genera- 
tion, and the effects of electronic noise in the signal acquisition process. 
Electrical signals rely on the movement of charges in electric and mag- 
netic fields, and we will investigate this for different detector materials in 
chapter 4. Another signal can be scintillation light from excitation of the 
detector material, and this will be the topic of chapter 5. 

Chapters 6 to 9 discuss how these principles lead to a signal from the 
detector for different types of detector materials. This part starts in chap- 
ter 6 with a look at emulsion and bubble chambers, which are largely of 
historical interest but have some modern applications, and then discusses 
detectors based on gaseous, liquid and solid detector materials, each in 
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their own chapter (chapters 7, 8 and 9). The reader wanting to know about 
some but not all of these techniques could just read the relevant chapters. 

The remaining chapters then discuss how these signals can be used 
to build experiments for particle physics, in increasing size and complex- 
ity. Chapter 10 explains how gaseous and semiconductor detectors can be 
combined with magnetic fields to determine the trajectories and momenta 
of charged particles and to reconstruct interaction vertices. In chapter 11 
we will investigate the measurement of the energy deposited by an in- 
coming electron, photon or hadron, when it is fully absorbed by a block 
of detector material (a calorimeter). Chapter 12 explains how different Again, the reader who only wishes to 

. i : . A . learn only about specific measurement 
techniques can be used to identify the type of the incoming particle. Chap- techniques can just read the relevant 
ter 13 will give a brief introduction to how in particle physics experiments —“#4Pter 
the recording of events can be instigated by a trigger, with a rate of data- 
taking compatible with the limitations of data processing and storage. Fi- 
nally, in chapter 14 we will investigate how the technologies introduced 
throughout this book can be combined into powerful detectors that pro- 
vide us with data for sophisticated physics studies and what some of the 
concerns and constraints are that drive the designs of these experiments. 

In addition to a thorough introduction to the physics of particle in- 
teractions with (detector) matter and a description of the most common 
types of particle detectors, this book also includes introductions to prac- 
tically important aspects of particle detectors, like electronics, alignment 
and calibration, and simulation of particle detectors. Where suitable it will 
demonstrate all these principles with examples of detectors that have been 
built, whether only at a prototype stage, or for actual particle physics ex- 
periments, although it is not aspiring to present a complete list of all types 
of particle detectors ever built or proposed. 

At the end of each chapter there is a summary of key concepts that 
have been discussed in the chapter and a set of exercises that allow the 
reader to investigate aspects of the chapter in more detail. 

We would like to thank our colleagues Wade Allison, Christoph 
Amelung, Giles Barr, Hugo Beauchemin, Steve Biller, Pawel Bruck- 
man de Renstrom, Paula Collins, Louis Fayard, Alfredo Ferrari, Neville 
Harnew, Peter Jenni, Malcolm John, Hans Kraus, Paul Lecoq, Michel 
Levebre, Tim Martin, Steve McMahon, Michael Moll, Peter Phillips, 
Meinhard Regler, Armin Reichold, Martin Tat, Rob Veenhof, Dave Wark, 
Morgan Wascko, Norbert Wermes and Steve Worm, who gave us valuable 
inputs to this book. Any remaining mistakes are ours. We wish to thank 
Rebecca Hodges-Davies from Taylor and Francis for suggesting we em- 
bark on this project and Danny Kielty also from Taylor and Francis for all 
his help and patience throughout the process of writing this book. 


Introduction 


Detectors in particle physics detect subatomic particles. The size of par- 
ticle physics experiments can range from 10s of um to 100s of km, al- 
though typically the size of the detectors will be at the lower end of this 
range, with typical experiment sizes of up to a few metres. This has two 
important consequences: First, particle detectors are significantly larger 
than the Compton wavelengths (h/mc) of the particles they need to detect. 
It is therefore entirely justified to designate them as ‘particle’ detectors, 
and it is usually sufficient for the understanding of the detector to treat the 
particles as quasi-classical particles. 

The second important consequence is that most of the subatomic par- 
ticles cannot be directly detected, because they just don’t live long enough 
to allow for a direct observable interaction with detector matter. Even 
with a significant boost, only particles with a lifetime of about 107! s 
and above can be detected directly, and most types of subatomic particles 
decay faster, in particular all particles that have strong or electromagnetic 
decay channels available that satisfy kinematics and conservation laws. 
This leaves only a limited number of particles that directly will account 
for signals in a particle detector. 


in standard experiments in high granularity detectors 


Leptons e, L, Ve, Vu, Vr T 
Mesons m*, K*, KL 

Baryons p,n AO E>, 2°, OF 
Bosons Y 


Other sub-atomic particles that can interact with a detector are bound 
states of the two most stable baryons, the proton and the neutron (for ex- 
ample deuterons or as, but also other nuclei). Finally, some mesons con- 
taining c and b quarks could potentially be directly observed in high gran- 
ularity detectors, although there are no modern experiments doing this 
systematically. 

While charged particles can leave detectable signals already in thin 
layers of matter (for example in a few 10s of um of silicon), neutral parti- 
cles do often require more substantial amounts of detector material. For 
the weakly interacting neutrinos, the interaction cross-sections are so 
small that the probability for detection of any one particle is very low and 
therefore very massive detectors are needed. In cases where the properties 
of a specific neutrino need to be measured (for example in collider exper- 
iments), the presence and kinematic properties of these particles can only 
be determined from an imbalance of the combined 4-momentum of all 
detected particles in the event. 

All the other subatomic particles can only be observed indirectly, 
through their decay products, and their existence can often only be in- 
ferred from a resonance in the invariant mass reconstructed from the four- 
momenta of their daughter particles. 

The fundamental particles that are precluded from direct observa- 
tion by their basic nature are quarks and gluons, because of confinement. 
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The question how a quantum mechani- 
cal physics process results in a straight 
particle track in a detector has been 
addressed by Mott in a famous pa- 

per [381]. 

Direct detection here means observation 
of a signal generated in the detector by 
the particle itself. 


Table 1.1: Sub-atomic particles with 

a sufficiently long lifetime for inter- 
action with a detector (anti-particles 
are implied, where applicable). High 
granularity detectors are detectors with 
a high density of 3D detection elements, 
like emulsions or bubble chambers. 


2 


However, the hadronisation of these particles produces jets. Because of 
their abundance in collider experiments (in particular at hadron colliders), 
detectors are needed that are capable of measuring the kinematic proper- 
ties of such jets. This is often done for the jet as a whole, as the individual 
components are difficult to disentangle. 


1.1 PROPERTIES TO BE MEASURED 


There are cases when the only information required from the detector is 
evidence for the existence of particles, for example when the amount of 
ionising radiation needs to be measured (the classical example for the use 
of a Geiger-Miiller particle counter). Then there are cases where the lo- 
cation of incoming particles needs to be observed (for example, photons 
in digital imaging) or their energy (for example, if the spectrum of ener- 
gies from a y source is to be measured). However, in particle physics it 

is usually the complete 4-momentum of individual particles that needs to 
be determined because of the need of full kinematic reconstruction of an 
event in the experiment. 

The 4-momentum of each particle can be determined by measuring 
any combination of its 3-momentum (magnitude and direction), and the 
energy or its mass. The latter can be inferred if one identifies the type of 
particle, either by observing a process or experimental signature only pos- 
sible for that particle type, or by other means, for example by combining a 
measurement of the speed with a measurement of the momentum. 

The direction of the 3-momentum can be found from the measurement 
of at least two spacepoints along the particle’s trajectory in 3D space. The 
momentum of charged particles can be inferred from the curvature of the 
particle’s track in a magnetic field, which requires the measurement of at 
least three positions of the particle along its trajectory. In both cases, the 
space points must be separated enough to allow for a large enough lever 
arm to make a good measurement, and the resolution of the spatial mea- 
surement will directly affect the measurement of the kinematic variables. 
Any disturbance of the particle’s trajectory before and between these mea- 
surements will also deteriorate the direction or momentum measurement. 

The energy is typically measured by completely absorbing the particle 
in a sufficiently large block of matter and recording the deposited energy. 

In addition to these measurements of the kinematic properties, particle 
physics experiments often require the rejection of backgrounds and, in 
cases where the raw event rate exceeds the capability of the readout to 
store data, fast selection of events to be recorded, which is typically based 
on correlations of signals in different detector elements. This selection is 
the task of triggers. 


1.2 EXPERIMENTS IN PARTICLE PHYSICS 


Particle physics experiments today are large and/or complex instruments, 
often comprising several different detector systems to provide improved 
coverage of kinematic parameters, sensitivity to different initial and final 
state particles, redundancy, identification of backgrounds, rate normalisa- 
tion, and/or capability to cross-check and calibrate systematic effects. 
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Jets are bundles of closely spaced 
parallel particles resulting from the 
hadronisation of a parton. 


See chapter 12. 


See chapter 10. 


See chapter 11. 


See chapter 13. 


This section is not intended as a com- 
plete list of particle physics experiments 
but gives an overview of the require- 
ments and environments that drive the 
design choices of particle detectors. 
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At the same time, the large range of physics studies in which the de- 
tectors are employed result in very different requirements and environ- 
ments. 

Arguably, one of the most demanding environments for particle de- 
tectors are the experiments at the Large Hadron Collider (LHC). There, 
very high collision rates put high demands on speed, segmentation, read- 
out bandwidth, event selection and radiation hardness, and the energies of 
particles to be detected cover a very large range from ~ GeV to 100s of 
GeV. The broad range of physics targeted by these experiments requires 
sensitivity to and discrimination of a wide range of particles. 

Since the start of operation in 2008, the LHC has been undergoing 
continuing improvements, leading to ever higher collision rates. The ex- 
periments have pursued a concurrent set of upgrades. These upgrades di- 
vide the operation of the experiments into periods that are separated by 
long shutdown (LS) periods. 


Table 1.2: LHC operation periods for pp collisions. Parameters for periods after 2023 are intended. Two LS are planned during phase 2, but not 
shown in the table. 


Run 1 LS1 Run 2 LS2 Run 3 LS3 Phase 2 (HL-LHC) 

Period 2009-2013 2015-2018 2022-2025 2027-2041 
Vs [TeV] 8 n 13 g 13.6 5 14 
Bunch spacing [ns] 50 S 25 S 25 Q 25 
Peak luminosity [em~2s~!] 8 x 103 a 2 x 1034 ei 2x 10% D 1035 

— = qQ 

© > S 
Average miraber of , 21 a 34 z 55 = 140-200 
collisions per bunch crossing 
Integrated luminosity [fb7 !] ~ 30 ~ 140 ~ 380 3000 


At the opposite end of the rate requirements are experiments searching 
for rare events. Here, good background rejection, shielding from cosmic 
backgrounds and radiopurity, and large detector masses are needed. Parti- 
cle energies in these experiments are generally low (K1 MeV). 

Experiments in neutrino physics also require a large target mass and 
good background suppression. A major task is identifying the flavour of 
the neutrinos. Experiments studying solar neutrinos or anti-neutrinos from 
reactors must be sensitive to low energies (O(MeV)) while atmospheric 
neutrinos have energies in the range of ~100 MeV to a few 10s of GeV. 
Very high energy neutrinos can be detected by instrumenting large vol- 
umes of ice or water. Neutrinos from accelerators have energies in the ee eae aie 
range from about 100 MeV to about 10 GeV. 

Finally, particle detectors are also employed in space, where additional 
challenges are posed by the environment and the launch. However, the 
same particle detector technologies as for terrestrial experiments are being 
used. 

In chapter 14, we will come back to these discussions and demonstrate 
how all these demands are met by the detectors in modern particle physics 
experiments. 


Interactions of particles 
with matter 


The biggest challenge for the detection of subatomic particles is clearly 
the small amount of energy these particles carry. A TeV particle has about 
1077 J of energy and the momentum is about 107!5 kgm/s, so that direct 
mechanical observation is not possible. 
However, subatomic particles do carry the charges of the electroweak 
or the strong interaction, and with these charges they couple to similar 
charges in the detector material and transfer some or all of their energy 
to the detector. This energy transfer results in excitation or ionisation of Ultimately, the energy will be trans- 
i R . : ferred to phonons, which in special 
atoms in the detector, which in modern particle detectors are used to create ‘Circumstances can be detected (see 
an electrical signal large enough that it can be amplified and recorded by section 11.2). 
electronic means. 
Ultimately, the signal creation in particle detectors always involves 
electromagnetic interactions. We will therefore start with a discussion of 
how photons interact with matter. 


2.1 INTERACTION OF PHOTONS WITH MATTER 


Due to the nature of the couplings in QED any interaction of an incoming 
photon with the charged fermions in the detector medium will result in 
the destruction of the photon. It is thus impossible to follow the path of a T? maintain four-momentum conserva- 
: . i . . tion this is sometimes associated with 
photon in a detector. The path can only be inferred if the starting point of the creation of another photon, but that 
the photon can be deduced by other means. photon does generally not retain the 
y . . E direction of the former. 
Depending on the photon energy, there are three major interaction 


mechanisms. 


Yr a on R ‘eel T ae ee ae Figure 2.1: Cross-sections for the 
i Carbon (Z = 6) H $ PN Lead (Z = 82) 4 interaction of photons for a low-Z and a 
o - experimental Otot 4 yo o — experimental Otot high-Z element (modified from [379]). 
\ | In these plots we have omitted coherent 
= scattering, photonuclear reactions and 
pair production in the electron fields. 
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of photons in matter is the mean 

a free path, which (in cm) is given by 
Pair 4 A = 1.66ma/(P©), where p is the mass 
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Photon Energy Photon Energy for both the atomic molar mass and the 
dimensionless atomic mass number. We 
use m, and A for the atomic molar mass 


PHOTOELECTRIC EFFECT (typically in g/mol), and the atomic 


mass number, respectively. 
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At low energies (up to about 1 MeV) the dominant interaction of pho- 
tons with matter is photoelectric effect. The photon gets absorbed by 

an atom and its energy frees an electron, which escapes with an energy 

E = Ey— Ebina, where Eping is the binding energy of the electron. At very 
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low energies this equation constitutes a threshold. Below the threshold en- 
ergy for photoelectric effect the material can be transparent, if there are no 
molecular resonances. This energy range is called the ‘optical’ region. 

For a single atom the binding energy of the interacting electron equals 
the ionisation energy. For solids the relevant parameter is the ‘photoelec- 
tric work function’, which is defined as the minimum energy required to 
remove an electron from the solid to infinity. In the solid, the states of 
outer electrons are arranged in bands, which are occupied up to the Fermi 
level —€p. In addition, we also need to consider surface effects. A simple 
model for the surface effect in metals comprises a uniform layer of surface 
dipoles (often called a ‘double layer’). To penetrate this double layer an 
additional energy Ws, depending on the surface properties, is required, so 
that the work function W = —€f + W, [73]. 

In metals, the highest energy electrons will be in the conduction band, 
where they are less well bound than in a single atom, and thus the work 
function is typically 30-50% lower than the ionisation energy for a single 
atom, typically in the range of 3 to 5 eV. The lowest work functions are 
observed for alkalis (for example, caesium with a work function of about 
1.8 eV). 

For photon detection in semiconductors it is in principle sufficient to 
lift the electrons across the band gap between the valence and the conduc- 
tion band, which then allows movement of the electron (and the positive 
hole left in the valence band) towards collecting electrodes. The small 
energy required for this means that semiconductor sensors are sensitive 
down to the infrared range of energies. For silicon the band gap is about 
1.12 eV, which allows interaction of photons with wavelengths up to about 
1100 nm. 

In solid metals and semi-conductors collective oscillations of the free 
electron gas density in the valence band (‘plasmons’) can occur at defined 
energies. Photons with this energy can be absorbed without the emission 
of a photoelectron, and their energy used in the excitation of a plasmon. 
Typical plasmon energies are between 10 and 30 eV. 

At energies well above the ionisation threshold the photoelectric cross- 
section of compounds is to good approximation given by the sum of the 
cross-sections of the constituent atoms. 

The cross-section for photon absorption in single atoms can be calcu- 
lated using time-dependent perturbation theory. In the following we will 
demonstrate this for the case of hydrogen. In the case of a weak field, the 
transition of the momentum operator, p > p— eA(?, t), with e the charge 
of the electron, and the vector potential describing the interacting field 

A(?,t) =Ao(@) £ iE, (2.1) 
can be introduced as a perturbation ÔH (t) = (—ihe/m)A(?,t) - V. In 
first order time-dependent perturbation theory the transition rate from the 
atomic ground state to a state n can be found from Fermi’s Golden Rule 


20 


ln = > 
h 


|(Pn| 5H1(t)|%o)|” p, (2.2) 
with the corresponding (hydrogen) wavefunctions Yo and ¥,, and p the 


density of final states. The cross-section for the transition is then found to 


In interactions with molecules at low 
energies, the energy of the photon can 
be used up in excitations. This type of 
photon absorption can occur at energies 
below the ionisation threshold, i.e. 
without the emission of an electron. 


The minus sign here indicates that the 
Fermi level is lower than the energy of 
an electron at infinity, i.e. electrons at 
the Fermi level are still bound. 


The work function can thus depend on 
surface properties like crystal orienta- 
tion but also contamination. 


A/pm = 1.24/(E/eV). Visible light is in 
the range from 1.75 (red) to 3.1 eV (vio- 
let). 


However, silicon is an indirect band 
gap semiconductor, which means that 
to conserve energy and momentum 
additional phonons are required, and 
the transition will be suppressed at 
energies below 3.4 eV. This will be 
discussed in more detail in section 9.8. 


For the detailed derivation see for 
example [162]. 


This is an example of “minimal substi- 
tution” and the fundamental justifica- 
tion comes from demanding U(1) gauge 
invariance [270]. 


È describes the polarisation of the field. 


Note that Fermi’s Golden Rule also 
implies energy conservation between the 
initial and final states. This is sometimes 
explicitly expressed using 5-functions. 
To improve readability we will omit 
these here, but energy conservation 
should be assumed throughout this 
chapter. This also removes the time- 
dependent term in the exponential in 

eq. (2.1). 
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On = —z— (Prle E VY), (2.3) 
mM Wn 
with @, = (En — Eo) /ħ. 


For low energy photons, the wavelength is much larger than the size of 
the atom, and thus we can approximate the exponential in the matrix with 
1+ik-7. In this approximation, the matrix element becomes 


= MOn 


(Plet. VIY) > ; 


(Pn lE- FIP). 


This yields, assuming random polarisation of the photon, for the integral 
cross-section 


T X On 


4 2 
Jondo, = “30 


It is common to introduce the ‘dipole oscillator strengths’ 
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(2.4) 


The cross-section for a transition to the state n can then be written as 


2n ah 
J 0, d@, = Ía- 


In the photoelectric effect the electron does not transition to a bound 
state, but escapes with positive energy. We assume that the electron is 
non-relativistic and its energy therefore (ke)? / (2m). For sufficiently 
high energies, the interaction of the electron with the nucleus can be 
neglected, and the emitted electron be described by a plane wave, 

Y, =V~!/2 exp(ik, : F). The matrix element then becomes 


—ike Te krz œ -A 43 
E- VPo(r) d?r. 
=a fe 


The cross-section for a transition to the continuum can then be obtained 


from me, 
4r ah 
0(@) = TEL | Mel? p(o)ae. 
m?o 


The transition to the continuum is characterised by an ‘oscillator 
strength density’ 


Moo = (Peji 2 (2.5) 


Z | iMaPol 26) 
~ mo A l 
The oscillator strengths am a sum rule, 
df 
—da=1 25) 
I do OT >? E 


where the oscillator strength density formally includes the discrete transi- 
tions by addition of f ô(@, — @). 

For many-electron atoms the position operator in the matrix elements 
needs to be replaced, 7 > Fij Ti, and the sum rule (eq. 2.7) adds up to Z 
instead of 1. 
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(o) = Vm? Jho _ 
d T2 V2 


& is the fine structure constant, 
1 æ 
~ Amey hic’ 
This is referred to as the ‘dipole approx- 
imation’. 


Vmp 
Qn hr 


For random polarisation 


eas 1 = 
Pnlé Fo)? = 5 (LaF Bo). 


This equation is known as the Thomas- 
Reiche-Kuhn (TRK) sum rule. For a 
detailed discussion of sum rules see for 
example [133]. 
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The matrix element in eq. (2.5), and thus the cross-section for photo- 
electric effect drops for increasing energy (with decreasing wavelength), 
as the wavelength of the incoming photon gets smaller compared to the 
dimension of the atom, so that in the integration of the matrix element 
the periods of the electric field average out. For higher energies, the shell 
structure of the atom becomes visible with an absorption edge for each 
new energetically accessible shell, with the K shell the innermost and 
highest energy shell. At any given energy electrons are emitted predom- 
inantly from the lowest accessible shell. Subsequently, the atoms will re- 
turn to the ground state in a cascade of de-excitation steps, including the 
emission of fluorescence photons and Auger electrons. 

Because of the complexity of shell and shielding effects, there is 
no closed form description of the cross-section for photoelectric effect, 
but typically it scales with Z” with m between 3 and 5, and E,”, with n 
around 3. The strong decrease for the cross-section at higher energies (sev- 
eral 10 keV) is the reason that X-rays are highly penetrating and are useful 
as a diagnostic tool. 

At low energies (i@ < mec’) the direction of the photoelectron emis- 
sion aligns with the electric field in the incident electromagnetic wave, i.e. 
perpendicularly to the direction of the incoming photon. With increasing 
energy the emission becomes more and more forward peaked. 
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COMPTON SCATTERING 


In the intermediate energy range (about 1 MeV) Compton scattering is the 
dominant interaction. The cross-section for Compton scattering on a free 
electron can be derived from the Dirac equation and is given by the Klein- 
Nishina equation [310] 


ofS aan (ONC) [i ea eli +2) : 


mec? x2 1+2x x 
In(1+2x) 
2x 


14+3x 
(1+ 2x)? 


| , (2.8) 


with x = E,/(m,c”). For low energies the cross-section for scattering on 
free electrons becomes constant and equal to the cross-section for Thom- 


son scattering, 
th 82 / ahe a 
Oo —_— 0 = — 7 i 
x0 3 \ mec 


(2.9) 


The strong dependence on Z is the 
reason high-Z materials (e.g. lead) 
are preferred shielding materials for Y 
radiation. 


The rapid variation with Z means that 
X-rays are particular sensitive to heavy 
elements, e.g. calcium in bone. 


Figure 2.2: Left: Cross-section (scaled 
with E>) for photoabsorption for 
different noble gases (data from [359]). 
Right: Angular distribution of photo- 
electrons [389]. 


Compton scattering is therefore often 
the predominant interaction for ys from 
radioactive decays. 


y e 


e Y 


Figure 2.3: Tree level Feynman diagram 
for Compton scattering. 


Thomson scattering is the elastic scat- 
tering of photons by a free charged 
particle. It does not result in a change 
of frequency of the photon. It applies, as 
long as the wavelength of the photon is 
large compared to the Compton wave- 
length h/mc of the particle. Thomson 
scattering can be described by classical 
electromagnetism (see exercise 7). 
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For high energies the Klein-Nishina cross-section decreases and can be 
approximated by 


3 1 1 
oN a | ne 
x00 8 x 2 
For Compton scattering on electrons bound in atoms the cross-section 

at high energies (x > 1) follows the Klein-Nishina result, and the cross- 
section for Compton scattering on the atom is o = Zo¥N. For low ener- f 

: ae . Another elastic scattering process, 
gies, however, binding effects are important. If the photon does not dis- Rayleigh scattering, occurs at low 
lodge the bound electron it will scatter coherently, without energy loss. energies. This is elastic scattering of 

the photon from the atom as a whole. 


Thus the cross-section for Compton scattering decreases for low energies However, it is usually not relevant for 


once the photon energy becomes comparable to, or smaller, than the ener- P&rticle detectors as its cross-section 
is significantly smaller than the cross- 


gies of the bound electrons in the atom. section for photoelectric effect in this 
Photons are scattered in all directions at low energy, but are energy Tange, 
i i i i x his is a general feature of photon 
dominantly scattered in the forward direction when x becomes larger TI l l 
than 1 emission at relativistic energies (‘head- 


light’ or ‘searchlight’ effect). Emission 
becomes forward peaked with a char- 
acteristic angle 0 ~ y`! (see also 
exercise 4). 


120 60 


Figure 2.4: Polar angle distribution 

for the scattered photon in Compton 
scattering (the distribution is symmetric 
== x=1/100 = =x=0.2 "=!" x=1 ===x=3 ——x=10 in the azimuth). 


Ignoring binding effects, the scattered photon energy can be found 
from 4-momentum conservation, E},/Ey = [1 +x(1 — cos 0)]7}, where 0 
is the scattering angle for the photon. The kinetic energy of the scattered 
electron is then 


x(1—cos @) 
E= Ey ——————., 2.10 
"1 +x(1 —cos 6) mm) 
which has a maximum for head-on collisions of 
2x 
Be LaS Er (2.11) 


The scattered electron typically loses its energy to ionisation in the 
detector until it comes to a stop. The scattered photon continues, until it 
either has another interaction in the detector, or it escapes. 


PAIR PRODUCTION 


Once the energy of the photon exceeds 2mec?, it can create an elec- 


tron/positron pair. To maintain 4-momentum conservation it can only do e 
this in the electric field of a charged particle, which absorbs the recoil. Pair Y e 
production usually occurs in the field of a nucleus. It can also take place in j 
the field of a shell electron, but because of the smaller charge the atomic Y 
cross-section for this type of pair production is smaller by a factor Z. Ze 

Photon absorption due to pair production starts at threshold (2mec?) Figure 2): A tree level Penman 


. i . í A diagram for pair production. 
and increases with the photon energy. For high energies (i.e. full screen- 


ing) the absorption coefficient tends to a constant value 
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M=nmo0=-—, (2.12) 
where ną is the volume number density of nuclei, and Xo is called the ‘ra- 
diation length’, which we will investigate in more detail in the next sec- 
tion. 

In pair production, the excess energy above the energy required to cre- 
ate the electron-positron pair is carried away as kinetic energy by the two 
leptons. They in term lose their energy to bremsstrahlung or ionisation, 
until they come to a stop. At this point the positron will annihilate with an 
electron in the material, with the emission of two 511 keV photons. Those 
can be absorbed in the detector by photoelectric effect or Compton scatter- 
ing, or, if the detector is of limited size, escape the detector. 


BREMSSTRAHLUNG AND THE RADIATION LENGTH 


The radiation length Xo is defined for the closely related process of 
bremsstrahlung, in which an electron in the field of another charged parti- 
cle (e.g. a nucleus) emits photons. 

For large energies the energy loss of the electron per track length is 
proportional to the energy E of the incoming particle 


=- (2.13) 
and the radiation length is the distance at which the energy of the in- 
coming particle has dropped by a factor 1 /e due to the emission of 
bremsstrahlung. Again, the radiation length describes the radiation 

process at high energies, corresponding to the limit of full screening 

(E > (aZ'/3)—'me?). 

We can make a simple estimate of the radiation length using a semi- 
classical argument, based on the Weizsäcker-Williams model [498, 513]. 
In this approach, the electron in its frame sees the charge in the ma- 
terial (i.e. the nucleus) as accompanied by a cloud of virtual photons. 
These virtual photons scatter off the electron by Thomson scattering. By 
combining these two effects we can determine the photon spectrum for 
bremsstrahlung and hence estimate the radiation length. 

For a relativistic charge with Lorentz factor y, and an impact parame- 
ter b the electromagnetic fields are predominantly transverse with E ~ cB. 
These fields can be considered as due to virtual photons. The peak electric 
field, when the distance between nucleus and electron is smallest, is 


Ye 
| 
4meéqb2 


An observer at rest sees a field greater than half this value for a time 
At = b/(yc). The frequency spectrum of this pulse extends up to a max- 
imum frequency Omax % (At)~! = yc/b. The total energy in the pulse is 
U= E0EZV, where V is the volume, V ~ Tb? x b/y. Hence, 
2 

167 €ob 
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The factor 7/9 in this equation is purely 
due to definition of Xo, which origi- 
nates from the description of a related 
process, bremsstrahlung, as discussed 
below. 


Decay into two photons is required to 
conserve 4-momentum. 


Y 
Ze 


Figure 2.6: A tree level Feynman 
diagram for bremsstrahlung. 


As the name implies, the radiation 
length is a distance. As we will see 
below, it scales with pre In some 
situations it is useful to take out this de- 
pendence and use a ‘reduced’ radiation 
length Xj, = pXo, which has units of, for 
example, g/cm?. The length dimension 
can be restored by dividing by the mass 
density. 

It is common to use the same designa- 
tion, ‘radiation length’, and the symbol 
Xo for both properties. We do not sup- 
port this practice. 


See exercise 3. 


The y factor comes from the Lorentz 
transformation of the electric field in the 
direction transverse to the boost. 


The energy in the electric and magnetic 
fields is equal for electromagnetic waves 
in vacuum. 
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Assuming all this energy is radiated, its spectral density can be ap- 
proximated by 
dU(@) U è ah 
d® ~ Omax l6me&c 4° 


The number of photons of frequency @ is given by 


dN,(@) = a (2.14) 


A more accurate calculation for the virtual photon spectrum for an elec- 
tron with speed Bc gives [268] 
dNy(@) 2a 
doa xa 


(2.15) 


We can now combine this flux with the Thomson cross-section for 
scattering of low energy photons on electrons (eq. (2.9)). The number of 
interactions of a particle crossing a thickness dx is nodx, where o is the 
cross-section and na = Nap/ma is the volume number density of nuclei, 
and Na is Avogadro’s constant, p is the mass density and mą is the atomic 
molar mass. We can use this to combine eqns. (2.15) and (2.9) to calculate 
the energy loss 


@max dN. 
aE = | hon,odx — dw = 
0 dæ 


Omax se 2. 203 2 
=f T o NaP Za h Ses. 
m @ \ mech? 


We can integrate this up to the energy of the electron (E = @max) and 
re-write it as 
1 dE _16Na py (_h 5 
pE dx 3 m mecB2 ) ` 


Comparing this with the definition of the radiation length (eq. 2.13) we 
obtain an estimate for the radiation length, 


11 16 hoe Na oo. 
pXo  3p4m z c? m 


(2.16) 


The cross-sections for bremsstrahlung and pair production can be 
calculated from QED, but the derivation is complicated by the screen- 
ing of the nuclear charge by electrons in different shells. A measure of 
the extent of screening is given by the ability of the momentum transfer 
to resolve the charge structure of the atom, given by its approximate ra- 
dius aọZ!⁄. Considering the kinematics, full screening is achieved for 
Ey > (aZ!) !mec? [132]. 

The detailed calculation including shell effects yields [477, 505] 


11 Ka? Na eo 
=4 Z” [Laa — f(Z)| + ZL 2.17 
P Xo mac Ma { [ rad f( )] aI a ( ) 
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The virtual photons are thus predomi- 
nantly soft. 


As the photons are predominantly soft, 
their wavelengths are large compared 
to the size of the nucleus and the pho- 
tons are coherent, so that the matrix 
element for scattering scales with Z, 
and the cross-section can be obtained 
by multiplication with Z?. 


The radiation length is inversely pro- 
portional to the matrix element squared, 
where the matrix element M «Ze* for 
the interaction with the nuclei. For the 
interaction with the shell electrons, the 
contribution to the radiation length is 
«Z|M|*, with M«e. 
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where Lyaq and L/f q can be approximated for elements with Z > 4 by 
Lrad = In(184.15Z~"/3) and L/,, ~ In(1194Z~7/3), respectively. f (Z) is 
a small correction @ (az?). See [379] for tabulated values of Lraq and EA 
for Z < 4 and a parameterisation of f(Z). 

A simple approximation for the reduced radiation length in g/cm? is 
pXo = 180A /Z? (good to 20% for 13 < Z < 92). 

For inhomogeneous materials the effective radiation length can be 


found from i 


E oa 
n ex 


where the v; are the volume fractions and x the radiation lengths of the 
components, and the sum runs over the components of the material. Simi- 


larly, for compounds 
1 


mij 
Xo XE 
i “0 
where the m; are the mass fractions of the elements in the compound. 


INTERACTION OF PHOTONS WITH DIFFERENT MATERIALS 


As shown in the previous sections, the cross-sections for the different 
types of photon interactions with matter have a different dependence on 
the atomic number of the material. Consequently, Compton scattering 


dominates over a wide energy range for materials with a low Z, but is con- 


fined around 1 MeV for high-Z matter. 
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2.2 ELECTROMAGNETIC FIELDS AND MATTER 


So far, we have treated electromagnetic phenomena as particles, which 
is in the spirit of a book on particle detectors, and arguably justified 

in the case of higher energy photons. However, as we will see shortly, 
low-energy electromagnetic interactions underlie the energy transfer of 
charged particles to matter. These can, to a large extent, be described by 
electromagnetic fields, and we therefore will now briefly revise the be- 
haviour of such fields in the presence of matter. 
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Remember that the radiation length 

is the scale length for bremsstrahlung 
and pair production in the limit of high 
energies. 


At very high energies and material 
densities the cross-section for these 
processes are reduced, due to the Lan- 
dau—Pomeranchuk—Migdal (LPM) 
effect [311]. Practically this is only 
relevant for the detection of extremely 
high energy cosmic rays. 


Figure 2.7: Most probable interaction 
mechanism for photons with matter 
(data from [128]]). 
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Electromagnetic fields are described by Maxwell’s relations, which in 
matter take the form of 


ee OB (2.18) 
VxH=j+ 


with the electric displacement field D = eË and the magnetic flux density 
B= uw . The proportionality constants € and u are the permittivity and the 
permeability, respectively, which can be split as € = €9&, and U = Uolr, 
into the value for the vacuum and a relative factor describing the effects of 
the medium. 

Introduction of the magnetic vector potential A, and the electric scalar 
potential @, defined by 


=> = > => = oA 
B=V xA, and E V0 - 57> 


satisfies two of the Maxwell equations, but does not define the potentials 
unambiguously. Adoption of the Lorenz gauge 


= Oo 
V-A —= 
+ELU a 0 
results in 7 
=- O-A z 
-V° + ene = jj, 
ss (2.19) 
_vVv2 R ae 


which describe wave equations with source terms uj and p /€. 

A plane wave Ē(F,t) « exp|i(k-7— @t)] is a solution to these wave 
equations, with a phase velocity c = w/k = (Eolo) "2, which gets modi- 
fied by a factor 1/n = (€,u,)~!/? in a medium, with n the index of refrac- 
tion. Absorption of a wave in the medium can be described by an imag- 
inary part of the index of refraction, n = nı + in2. The photoabsorption 
cross-section can then be written as Oy = 2@n2/ (nec). Often the rela- 
tive permeability u, is close to 1, and thus the absorption is attributed to a 
complex permittivity € = £; +1€, and, for €; >> &2, 


Bg (2.20) 


where the last approximation is valid for low density media (i.e. gases), 


and thus = = len ae 
As noted above, generally the complex permittivity will be a function 
of the frequency, and we will refer to €.(@) = €1(@) + i€2(@) as the ‘di- 
electric function’. As we have seen in section 2.1, the cross-section for 
photon interactions does change quite dramatically at low energies due to 
shell effects on the photoionisation cross-section, and thus we also expect 
significant structure for the dielectric function. To understand the origin 
of the variations of the index of refraction, we can use a simple classical 


model that is actually reasonably applicable for ionic polarisations. In the 
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We assume that these parameters are 
linear, isotropic, homogeneous and time 
independent, but as we will see later, 
they will depend on the frequency of the 
oscillation of the fields (i.e. the material 
is dispersive). 


The characteristic length, with which 
the amplitude of the wave decays ex- 
ponentially, is aps = (1/2)Imk = 

(Ne Oy) | with ne the number density of 
scattering centres, here electrons, and 
0, the cross-section for scattering on 
one electron. 


The use of the letter “n” for both the 
index of refraction and the number 
density is unfortunate, but we follow 
common practice here. 


Interactions of particles with matter 


unperturbed state, the ions will be held in position by ionic bonds that can 
be described by a parabolic potential for not too large disturbances. An 
excursion of the relative position of the ions will thus result in oscillations 
with a frequency of @p. In the following we will assume a simple binary 
bond between two ions of charge e. In the presence of an external field 
E(t) = Eyexp[i(kr— ar)| the dynamics of the bond separation will be 
described by 


dr dr E" z ir- 
H +a + Wop? = —eF (t) = —ePpi t00, 


where u is the reduced mass and we also have added a damping term pro- 
portional to a damping constant I’. The solution of this equation is 


> e 1 ee ht 
F(t) = J a pe. 
U @ — @* —iol 
This result can be used to find the polarisation 
2 
= , ne > os 
P = -nef = Ege ™®™ 


where n is the density of bonds. But £, = (£&9E + P) /(€9E), and thus 


ne? 1 
EoL œf — o? -— ior 


&l@)=1+ 


This equation describes a resonance, which in Atomic Physics is re- 
ferred to as a Lorentzian. For ionic bonds typical resonance frequencies 
are in the infrared. 

For atomic electrons similar behaviour is observed, even though they 
cannot be treated as classical particles with fixed equilibrium positions and 
linear restoring forces within the atom. Instead, their state is described by 
a wavefunction, which is distorted quasi-elastically by weak perturbations. 
The distortion can only occur to another eigenstate of the atom (including 
highly excited states and even the continuum), with the frequency Wp now 
describing the energy difference of the transition. The total electronic po- 
larisation is given by the sum of the resulting resonance terms, weighted 
by an ‘oscillator strength density’ df /d@. The oscillator strength den- 
sity comprises transitions to discrete atomic states (which can be formally 
described as additive terms <f,,6(@— @,)), as well as states to the contin- 
uum. In solids discrete atomic levels rearrange into bands of states, which 
essentially become continuous. 

Using the oscillator strength density, the relative permittivity is given 
by 
Po 1 5 (0') 

PJ @?-— a? -ior 


where we have used the plasma frequency @,, defined by 


&,(@) = 14 a’. (2.21) 


2_ 4naħcne 


o = (2.22) 
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We assume that the rate of change of 
position dx/dt is small compared to 
the speed of light, and hence we ignore 
magnetic effects. 


In the dipole approximation the phase 
of the electric field at any time is effec- 
tively the same over the whole region 
occupied by the electron, and the spatial 
term in the exponential can be dropped. 


The most important contribution to 
damping are collisions, which result in 
the loss of energy stored in the vibra- 
tion. 


wlwg 


Figure 2.8: Real and imaginary index of 
refraction (T = @/10). 


This is equivalent to the oscillator 
strengths which we have introduced in 
eqs. (2.4) and (2.6). 


A useful dimensional approxima- 

tion for the plasma frequency in eV is 
28.8,/pz/A, where p is given in g/cm, 
and z is the effective number of free 
electrons per unit volume. 
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Again, the sum rule se dæ = | applies. 


Figure 2.9: The dispersion curve of a 
typical transparent medium over the 
electromagnetic spectrum. [292]. 
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The requirement to maintain causality between the polarisation and 
electric field leads to two important relations connecting the real and 


imaginary parts of the dielectric function (see for example [290]) rad the Kramers-Kronig relations 
2 o'a) — 
= f d P denotes the Cauchy principal value of 

EI (@) = ls P d , (2.23a) the integral. 


T o” = MA 
0 


da’, (2.23b) 


Comparison of eq. (2.21) with the Kramers-Kronig relation eq. (2.23a) 
suggests a relation 


—(@) = ——5&:(0). (2.24) 


This is confirmed by an alternative version of the TRK sum rule 
eq. (2.7) that can be found for €2 [504]. The integral in eq. (2.23a) can 
be split as 


Wc co 

2 ORSIO 2 ORSA O 

==P a dao! +? p/ AO) a 
o“ -— o 

0 


where @, is a cut-off frequency above the absorption region. For frequen- 
cies high above the absorption region, £; (@) > 1 — wo; / and €)(@) + 0. 
Thus the second integral goes to 0. If we now investigate this expression 
for @ >> @ we can ignore @’ in the denominator in the first integral and 


obtain 
2 Me 
Q, 2 į , , We don’t need the principal value 
T = a2 @ &(@ ) dæ > here any more because there is no 
@ TO 0 singularity. 


so that the sum rule becomes 


(2.25) 


T 
J osxodo =n oz. 
0 


But the Kramers-Kronig relations apply more generally for any com- 
plex analytical function. For example, 1/€(@) is analytical in the upper See for example [504]. 
half of the complex frequency plane, as €(@) is analytical and has no 
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zeros there. Consequently, 


co. 1 
1 2 o m(t) i 
re; =) =e ow da’, (2.26a) 
r 
F Re( =) -1 
1 2@ (z) ] 
Im (=) ee P PE — da’. (2.26b) 


Note that while appearing very similar, these equations are not directly 
derived from eq. (2.23). 
A similar derivation as before yields a corresponding sum rule 


F -1 T 2 
ol —— ]do = ~@-. 
l a(s) 2 


In principle the oscillator strengths can be calculated for a given ele- 
ment using time-dependent perturbation theory. However, in practice this 
is complicated, as it requires knowledge and processing of the electronic 
states of the atoms, including shell effects. A more pragmatic approach is 
to derive the dielectric function, and ultimately the oscillator strengths 
from the more conveniently available data of photon absorption using 
eqs. (2.20) and (2.23a). 


(2.27) 


2.3 CHARGED PARTICLE ENERGY LOSS 


For charged particles the primary means of interaction with the detector 
material is by electromagnetic interaction. Even though the detector ma- 
terial is on the whole electrically neutral, it provides plenty of electrical 
charges at the subatomic level. Of those, the most relevant for the gener- 
ation of a signal in a particle detector are electrons, which are relatively 
mobile and thus will readily respond to the stimulus of the passing parti- 
cle, and acquire energy from the incoming particle. 

It is instructive to look at this first using a classical approach, origi- 
nally developed by Bohr [152]. We start with modelling the interaction of 
the incoming particle of mass M and charge ze with a single electron of 
mass me, Where we make two assumptions: First, that the mass of the in- 
coming particle is high so that its motion is not disturbed by the collision, 
and second, that the electron is free to move and changes in its potential 
energy are negligible (it behaves as a free particle). The electron will then 
get a radial kick 


dt 2zæħ 
Ap = |Fdt=e /E,—dx= , 
4 I ej tak Bb’ 


where we have used Gauss’ law, fE | 2abdx = ze /£o. The energy gained 
by an electron at radius b is then 


_ Ap? 2R 


AE (b) =e 
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Im(1/€,(@)) is often referred to as ‘di- 
electric loss function’. This is a measure 
of the attenuation of an electromagnetic 
wave ina “lossy” dielectric. 


We will discuss the interaction with the 
nuclei in the material in section 2.10. 


AG Af 


Figure 2.10: Geometry used in the 
derivation. 


The effects of the longitudinal compo- 
nent cancel as the particle passes. 


This simple model is based on an elec- 
trostatic interaction only. A calculation 
using non-relativistic quantum mechan- 
ics is discussed in exercise 6. 


The energy transfer is independent of 
the mass of the incoming particle, and 
proportional to the inverse mass of the 
target particles. This is the reason en- 
ergy transfer to more massive particles 
in the material, i.e. nuclei, is negligible. 
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If the particle passes through a slab of matter of thickness dx, with an elec- 
tron density ne, the energy loss will be 


Anz 07hn, db 


dx. 
Mep? b 


dE = —AE (b)ne 2T7bdbdx = — 
The energy loss per path length (also called the ‘stopping power’) is then 
obtained from an integration over the distance b, 


dE 4ra? h ne ' bra 
= n 
dx me B* 


where we have introduced physical integration boundaries bmin and bax. 
These are given by the limits where the assumptions underlying this model 
break down. 

The lower boundary bmin reflects the fact that the maximum energy 
transfer is limited and occurs in a head-on collision. If we stay within 
the non-relativistic approach, then AEmax = 2mev* = AE (bmin) (for 
M © me), and the stopping power becomes constant for large veloci- 
ties. However, we can introduce some relativistic behaviour by using 
AE (Dmin) = 2Y?mev?. 

The upper boundary is given by the distance beyond which the interac- 
tion loses its ability to affect the electronic state. To estimate this, we as- 
sume that the electron is bound in an atomic orbit. If the interaction time, 
which can be estimated as f(b) = b(yv)~!, becomes longer than the or- 
bital period w~! of the electron, the influence on the electronic state will 
be adiabatic, and no energy will be transferred. 

The combined result of this classical derivation (with the relativistic 
cut-off energy) is thus 


(2.28) 


b min 


dE 4 2 272 2 3 
ey EL n(2 id | (2.30) 


dx meß? zacho 
There are several things to note: 


e The energy loss is independent of the mass of the incoming particle; 


e Itis proportional to the electron density of the detector material, 
otherwise it is only weakly depending on the detector material 
(through @); 


e As it originates in the interaction of the electrons in the material 
with the charged particle, it is proportional to z?@?; 


e For low velocities (By < 1) it is dominated by the falling 1/B* func- 
tion, whereas for large velocities it will increase slowly due to the y? 
dependence of the argument in the logarithm. 


A quantum mechanical treatment of the energy loss problem has been 
developed by Bethe [130]. This result is the basis for a widely used stan- 
dard equation for the energy loss of a charged particle in matter [505] 

mec? B? y AE max 


1 fdE\ _ 2Z 1/1 5(BY) 
a(=)> a (5m rP 


2 ? 
(2.31) 
where the constant K = ARNO? h /me ~ 0.3 MeVmol7!cm?, / is the 
mean excitation energy (on a logarithmic scale) and 6(B) a correction 


B? 
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Figure 2.11: Energy transfer as a func- 
tion of impact parameter (after [290]). 


The exact expression is 
2 2 
2y°mv~ 


ME mas = ne Tay 
+ 275% + (46) 


(2.29) 


The rising section is often referred 

to as the ‘relativistic rise’, although 
this terminology is misleading, as the 
increase is really a consequence of 
the increased transverse reach of the 
electromagnetic interaction, as we will 
see in the next section. It only becomes 
relevant at relativistic energies. 


LdE . ‘ ; 
pdx called the ‘mass stopping 


power’. 


This equation is often referred to as 
the Bethe-Bloch equation. Bloch’s 
contribution was a calculation of the 
mean excitation energy, which is not 
commonly used today, where data for 
this parameter exists (see below). 
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due to the density effect. The latter describes the reduction of the relativis- 


tic rise due to the polarisation of the detector material. 
For the energy loss measured in a detector another modification of 


the relativistic rise is observed when the maximum energy transfer is lim- 
ited to a value Eout < Emax. As will be discussed in section 2.5, the energy 


transferred in individual interactions with the detector material has very 


long tails. The limit on the transferred energy cuts these tails, resulting in 
a weaker rise, and the energy loss becomes constant at high momentum. In 


practice, this occurs in thin detectors, because of the low probability of a 
high energy scattered electron being absorbed. 


- p”(dE/dx) (MeV g~'cm2) 


In the Bethe equation the only parameter that depends on the material 
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is the mean excitation energy. While it is often treated as a parameter to 
fit the data, it does have a physical interpretation, and can in principle be 
calculated from [243] 


InI 


_ Jo In(i@)o,(@)d@ 2 


go Oy(@)do zo? 


fom (zy) In (fœ) do. 


For most elements it can be parameterised as J = ZIo, with Jọ around 10 
to 12 eV. As it enters only logarithmically into the Bethe equation this 
variation is not significant for many applications. 
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A model for the density effect has been 
developed by Sternheimer [466, 467], 
although its predictions are not very 
satisfactory. For a better treatment see 
the next section. 


It is common to use the term ‘energy 
loss’ and dE /dx for the energy mea- 
sured in a detector layer, although this 
is not strictly correct, and can lead to 
confusion. 


This is called the ‘Fermi Plateau’. 


Figure 2.12: Mass stopping power as a 
function of By (modified from [379]). 
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Figure 2.13: Left: Mean excitation energies divided by Z versus atomic number [379]. Right: Mass stopping power at minimum ionisation versus 
atomic number [505]. 
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An important feature of the stopping power described by the Bethe 
equation is the minimum, which occurs at By ~ 3-4 with a mass stopping 
power of about 1-2 MeVg~!cm?. We call a particle with such a velocity a 
‘minimum-ionising particle’ (MIP). 


2.4 DIELECTRIC THEORY OF CHARGED PARTICLE 
ENERGY LOSS 


So far, our treatment has only considered electrostatic effects, and hence 
has been basically non-relativistic. The complete relativistic treatment 
of the problem requires to solve Maxwell’s equations eq. (2.18) for the 
source terms describing the incoming charge with velocity B G; 


p =zed (7- Bet) „and j = Bop. 


In the Coulomb gauge, the 4D-Fourier components of the potentials are 
given by [46] 


> ze 


1/€,£ dia B 


On k? — £,£0 L00? 


al 


(2.32) 
ze 


MBC Sio- Èpo, 


On k? — £,£0 L00? 


anl 


AK, o) 


with the dielectric function £, = €, (k, @) = £1 (k,@) +i£2(k, œ). 
The electric field generated by the incoming particle can be found 
from 


= 1 oa ae ee 
E(?,t) = ea lioä&, o) - KoE, o) dk- kdo. 


The stopping power is given by the decelerating force of the electric 
field E (F,t) at the location of the incoming particle 


a 
B 
After insertion of eq. (2.32), and transformation of the wave number into 


spherical coordinates, we integrate over the angular components, and use 
€(k, —@) = €*(k, @) to obtain 


dE 


rid Fae = zeE (Bct,t) . 


m( ay) 


(2.33) 


— o’) Im l : 
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The first term in this expression is called the transverse term, as it 
stems from the magnetic vector potential term (in the Coulomb gauge), for 
which the electric field is transverse to the direction of the 3-momentum 
transfer, K. The second term originates in the scalar potential @ and is 
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We are transferring into (@,k) space, 
as we will soon discuss ways to model 
the dielectric function in this parameter 
space. 


These expressions are known as the 
Liénard—Wiechert potentials. 


Here and in the following we assume 
iwi, 


The magnetic field does not alter the 
energy of the incoming particle. 


The multiple use of the letter E is unfor- 
tunate, but unavoidable. We denote the 
electric field as a vector by E, and the 
energy of the incoming particle as E. 


The -function 8(@ — k- Bc) = 
(kbe)! 8(@ -— kBccos @) in spheri- 
cal coordinates. 


The lower integration boundary for the 
integration over k reflects the kinematics 
of the interaction. Conservation of 4- 
momentum for an incoming particle of 
mass M and velocity Bc when emitting a 
photon with energy h@ and momentum 
hk requires 


no(1 ) neB -k 


which for small energy and mo- 
mentum transfers (h@ < yMc? and 
hk < YMcB) becomes 

won 
which implies kmin = ©/(Bc). 
In practice there is also an upper inte- 
gration boundary Omax, Which is given 
by eq. (2.29). 


Re 
2yM’ 


ho 
2yMc? 
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often referred to as the longitudinal term, as it has the electric field paral- 
lel to the momentum transfer. It is the term relevant in the non-relativistic 
treatment. 

The only unknown in expression (2.33) is the dielectric function, 
€-(k,@). Unfortunately, this function is neither trivial, nor easily experi- 
mentally accessible. Its structure is determined by the quantum mechan- 
ical states of the atoms in the detector material. In addition, the phase 
space covered by the integral extends beyond the dispersion relation for 
real photons. However, simplified models, which are based on the exper- 
imentally accessible photo-absorption cross-section oy(@) have been de- 
veloped to extrapolate the dielectric function into this region. One of these 
models, which is widely used for the accurate calculation of energy loss 
in particle detectors is known as the Photo-Absorption Ionisation (PAT) 
model [47]. 

At this point we will examine eq. (2.33) from a new angle and inter- 
pret this equation as an integral over collisions involving the exchange of 
individual photons with energy E = h@ and momentum q = hk. Because 
of the double integral, these photons will generally not be on mass shell, 
i.e. they will be virtual. 

The imaginary part of the dielectric function, €)(@), for real pho- 
tons (with k = @/c) can be found from the measured photo-absorption 
cross-section using eq. (2.20), and the real part €;(@) from that using the 
Kramers-Kronig relation eq. (2.23a). To evaluate eq. (2.33), we have to 
extrapolate this over the (@,k) plane covered by the integrals in eq. (2.33). 
In the PAI model the dipole approximation is assumed to be still valid for 
small k (in the resonance region), so that we can use €-(k,@) ~ £ (@) 
there. We assume that this region extends to Whe < 2m-h@. However, the 
loss function Im(—1/€,(k, @)) must satisfy the sum rule [51] 


R —1 T 2 
olm| ——— } dø = +0 
[ n( a) 2 P 


for all k. We attribute the deficit to the absorption of the photon by station- 
ary (quasi-free) electrons, for which non-relativistic energy and momen- 
tum conservation yields fi@ = hk? / (2me), which can be expressed as a 
ô-function. This contribution can thus be written as Cô(@ — hk? / (2me)), 
and C can be determined using eq. (2.34). One finds 


1 i =I 1 
c=— fo Im| — ] da’, 
OP € (@') 


and thus the loss function can be expressed as 


(Ean) =e (ea (ew) * 


(2.34) 
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The probability density do /dE for a 
photon exchange with energy E can be 
found from 


dE = do 
2 =n, | “Ede. 
d nef dE 


Sometimes the differential cross-section 
is given per atom. This just gives a 
different scaling factor for the cross- 
section. All relevant results (energy loss 
etc.) are unaffected by this choice as 
they depend on n,(do/dE). 


Note that this equation is different than 
eq. (2.27), as € depends on @ and k. It 
is thus valid for any slice in k. 


We are ignoring the Fermi motion of 
bound electrons. 
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After integration with this approximation the longitudinal energy loss 
is found to be 


Jom( ; ) ae = (239) 
j € (@') 


Essentially the only velocity dependence this expression features is 
the overall 1 /B* dependence which dominates the behaviour of the energy 
loss for low-energy projectiles. In the relativistic region it remains effec- 
tively constant. The second term in this expression is non-zero even for 
frequencies at which the photoabsorption cross-section, and thus &(@), 
has fallen to zero, hence it is the dominating term at frequencies above the 
absorption region. 

For the transverse term the integrand is largest when the denominator 
is small, @ ~ kc/,./€,, i.e. for states close to the real photon dispersion re- 
lation, hence we simplify by replacing €,(k,@) with the dielectric function 
for real photons €,(@). The integration with this approximation over k can 
be done analytically and yields 


dE 
(=) ~ | d@@ x 
dx trans 


- | ? 1 , 2 
Ki E Vi- Peo) +p (e 


zah 


mB?c 


2 1—82 * 
with @ = 5 atan ( ieee | = arg(1— pe (@)). 
The transverse differential cross-section is then given by 


(22) za . 
Ne | — = 
dE trans mB7hce 


m = n ' i 
i (a5)! anon (P 


The first term in this expression is the source of the relativistic rise 
and its saturation: In the transparent approximation (£2 ~ 0) the argument 
of the logarithm becomes (1 — v?/u*)~!/2 = y with the photon phase 
velocity u = c /4/£1. The term depends on the density as the denominator 
in the logarithm becomes |1 — B7¢,| in this case, and for B — 1 tends to 
|1 — €,| ~ n, proportional to the density. 

The second term is the only contribution to the energy loss that will be 
relevant at energies below the absorption region, where the loss function 
vanishes. It is of special interest when the material becomes transparent 
(€2 — 0). In that case, © is close to 0, if B < 1/,/€, and jumps to close to 


€ (E) 
aid) ° 
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(E) = (loat Ce) sana 


We introduced this term to account 
for the absorption of the photon by 
stationary (quasi-free) electrons. The 
integral over @' is summing over those 
electrons for a given @. 


The integrals are standard, but the eval- 
uation is elaborate, without providing 
much physical insight. 


It includes the screening of the electric 
field due to the polarisation of the 
material caused by the passage of the 
charged particle. 


The relativistic rise is a few % for solids 
and liquids, but can be 50%—-70% in 
high-Z noble gases. 
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T above. Above this threshold the second term becomes 


do = 7a 1 1 - 2a 
dE) cy, neħc Be.) nehe 
where cos 6c, = (B \/é,)~!. This term describes the energy loss due to 


emission of real photons, which is called ‘Cherenkov radiation’, which 
will be discussed in more detail in section 2.8. 
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While the PAI model has been developed for the interaction with 
gases, it can also be used for solids, for example silicon [276]. 
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The Bethe equation (eq. (2.31)) is a useful description of the average 
energy loss for charged particles, offering a simple yet reasonable parame- 
terisation of the energy loss as a function of the projectile’s velocity. How- 
ever, the dielectric theory provides several benefits: 


e The relativistic rise including the density effect is properly de- 
scribed by the theory and does not need to be added a posteriori as a 
correction. 


e Energy loss due to Cherenkov radiation is an intrinsic aspect of the 
dielectric theory. It is ignored in the Bethe equation. 


e The Bethe equation describes the mean energy loss, without provid- 
ing a probability density for the energy transferred. The dielectric 
model interprets the energy loss as multiple exchanges of virtual 


(2.37) 
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Below this threshold this term is an 
indistinguishable contribution to the 
transverse energy loss. 


Figure 2.14: Calculated energy transfer 
differential cross-section for argon for 
By =4 (minimum-ionising, left) and 
By = 100 (on Fermi plateau, right) 
[141]. The different shaded areas are 
(from top to bottom) due to distant lon- 
gitudinal collisions (white - first term in 
eq. (2.35), transverse collisions, (light 
grey - eq. (2.36)), and close longitudinal 
collisions (dark grey - second term 

in eq. (2.35). The peaks correspond 

to interactions with M, L and K shell 
electrons (from lower to higher energy). 


Another model based on dielectric 
theory, but specifically optimised for 
silicon, has been developed by Bich- 
sel [139]. In practice, results from 
this model are very similar to the PAI 
results. 


Figure 2.15: Dielectric loss function of 
solid silicon as a function of the photon 
energy [141]. Note the presence of the 
plasmon peak (see section 2.1) at about 
17 eV, which is due to the resonant 
excitation of collective oscillations 

of electrons in the valence band. L23 
describes X-ray transitions from 2p to 
3d states. 


As we will see in section 2.8, Cherenkov 
energy loss is only a small fraction 

of the total charged particle energy 
loss, and can thus be ignored in many 
applications. 
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photons with small energy transfers, for which a probability density 
can be given. This density can then be used to calculate fluctuations 
in the energy deposition in material slabs of finite size. This is par- 

ticularly important for thin detector layers, and will be investigated 
in the next section. 


It can be shown that, by introducing a number of approximations, the 
PAI model can reproduce the Bethe equation (eq. (2.31)) [46], although 
there is little benefit in reproducing a less accurate description. 

An extension of the PAI model is the photoabsorption ionisation and 
relaxation (PAIR) model. In the original PAI model, it is assumed that 
all the energy in the transfer from the primary particle to the atoms is ab- 
sorbed and converted into ionisation at the point of interaction. The PAIR 
model takes into account the shell structure of the atom, and that the pho- 
toelectron carries the transferred energy minus the binding energy of the 
given shell. After the emission of the photoelectron the atom is left in an 
excited state with a vacancy in the ionised shell and relaxes via the emis- 
sion of fluorescent photons as well as Auger electrons. Technically, this is 
achieved by separating eq. (2.33) into contributions for the different shells 


and the subsequent simulation of atomic relaxations and 6-electrons [460]. 


Implementations of the PAI model (with improvements) are part of the 
GEANT detector simulation package [62], and of the PAIR model in the 
HEED software [460] (see also section 7.8). Figure 2.17 demonstrates the 
agreement of predictions using HEED with measured data. 
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Figure 2.16: Relativistic rise of the 
most probable ionisation energy loss 

in a layer of Ar/CH4 93/7 (STP) with a 
thickness of 6 cm (relative to minimum- 
ionising energy loss). Open circles are 
experimental data, closed circles are 
from a calculation according to the PAI 
model [62]. 


Figure 2.17: Comparison of the num- 
ber of primary energy transfers for 
minimum-ionising particles for various 
gases at NTP according to calculations 
by HEED (circles) and various experi- 
ments [460]. 


Interactions of particles with matter 


2.5 ENERGY LOSS FLUCTUATIONS - STRAGGLING 
DISTRIBUTIONS 


Once the differential cross-section do /dE is known, the mean free path A 
between collisions can be found from 


Emax q 
j =neo =ne | OO dE = Mo. (2.39) 


Enin dE 


As can be seen from Figure 2.14, the differential cross-section as a 
function of the energy has long tails. Consequently, in each collision a 
varying amount of energy is transferred. As individual clusters are uncor- 
related, the distance between collisions is exponentially distributed. In 
particular in gases the distance between collisions can be macroscopic, 
and can affect the reconstruction of the incoming track (see section 7.5). 

Usually, the energy loss within a detector layer is small compared to 
the energy of the incoming particle, and because individual collisions are 
uncorrelated, the probability distribution for the number of collisions k 
within a distance x is given by a Poisson distribution 


where the expectation value (k) =x/A = xMo. 

As a consequence of the statistical fluctuations of the energy transmis- 
sion the energy deposited in a finite slab of detector material varies from 
event to event. In a thick layer the distribution of the deposited energy will 
acquire a Gaussian shape, in accordance with the central limit theorem. 
For thin layers the distribution of energy loss for a large number of incom- 
ing particles with identical energy is skewed, with a long tail due to rare 
high-energy clusters. Such a distribution is called a ‘straggling distribu- 
tion’. 
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For thin absorbers the average (dE/dx) is not a very useful parame- 
ter for characterising energy loss spectra. The most probable value A, of 
the energy loss distribution is usually significantly smaller than the mean 
energy loss. 

The probability density for a particle to lose an amount of energy E in 
a single collision is given by 


(2.40) 
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The first moment of the differential 
cross-section is 


Emax 
Mi =n, | sa |e 


Emin 


Higher moments describe the scale 
of the energy loss fluctuation (‘strag- 
gling’). 


Each of these discrete energy transfers 
is referred to as a ‘cluster’. 


Figure 2.18: Straggling distribution. A): 
most probable energy loss, (A): energy 
loss from Bethe equation, Mo and M;: 
cumulative 0" moment (mean number 
of collisions) and 1* moment (mean 
energy loss) [505]. 
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In principle, the probability for an energy loss of A in k collisions can be 
found from a recurrent convolution 


A 
F(A,k) = [FP @-Ey OaE. (2.41) 
0 


The probability for an energy loss of A in k collisions is the convolution of 
the distribution to lose A— E in k— 1 collisions with the distribution to lose 
the remainder in one collision. However, we are usually interested in the 
energy loss distribution for a given detector thickness, not for a fixed num- 
ber of collisions. Different approaches to find this are being used [140]. 

An early analytic approach was developed by Landau [324], and later 
modified by Vavilov [485]. The computations involved are quite com- 
plicated and, due to the approximations used, generally the results of the 
Landau calculations do not predict real straggling distributions very well. 
However, the general shape of the solutions are widely used for parameter- 
ising measured distributions, which led to straggling distributions gener- 
ally being referred to (not very accurately) as “Landau distributions’. 

A better approach is again by convolution. We start with dividing the 
thickness of the detector into thin slabs of thickness 6x, for each of which 
the probability (k) = 6x/A for a collision is small (typically a few percent 
in practical computations). The probability for an energy loss of A in this 
slab is then 


where f (0) (A) is the probability for an energy loss of A without a colli- 
sion, which is obviously a delta function at A = 0. From this, the material 
thickness can be built up by convolution, using 


A 
Finite = [FQ-E.n)FE 2) 42. (2.43) 


The discrete nature of the onean loss and the associated cluster 
size distribution is particularly observable in gaseous detectors, where the 
lower interaction density along the track leads to a wide separation of the 
collision events. 


2.6 EFFECTS OF CHARGED PARTICLE ENERGY LOSS 


Usually, the virtual photon exchange between the incoming charged parti- 
cle and the detector matter will result in the change of state of an electron 
in the material. Depending on the transferred energy this electron can be 
lifted to a higher bound state or, in a solid, into the conduction band, or 

it can escape altogether as a free particle. We have seen that if the trans- 
ferred energy is sufficient, it will be predominantly inner electrons which 
will be affected. If the excited electron escapes with sufficient energy it 
can result in further excitations and ionisation of the detector material. 


Interactions of particles with matter 


At the same time the electron will leave a vacancy, which will be filled 
in relaxation processes that involve the emission of X-rays and/or Auger 
electrons. The latter will typically lose their energy quickly in further col- 
lisions with the detector matter, whereas the former, due to the longer ab- 
sorption lengths for photons in the X-ray regime, have a higher chance to 
escape the detector volume without further interaction. 

The result of all the response effects is typically a number of ionisation 
and excitation events for each collision. Both of these effects can be made 
use of in particle detectors. Free charges can be moved in electric fields 
towards readout electrodes. The resulting induced currents on the elec- 
trodes can be used to record an electronic signal. Excited states will decay 
with the emission of photons (‘scintillation’), and if the detector material 
is made transparent for these scintillation photons, they can be guided to 
dedicated photon detectors. 

Usually, the secondaries are created in close proximity to the original 
collision. In gases the distance between the collisions is larger, and the 
clusters will display macroscopic separation. 

In the extreme case, the interaction can result in the transfer of a size- 
able amount of energy in a single collision, which gets transmitted to a 
single electron that then travels for macroscopic distances in the detector 
material. Such an electron is called a ‘6-electron’. ĝ-electron emission 
with enough energy to be distinguishable from the original track is rare 
(in argon only 5 x 1074 of the ionisation electrons have an energy above 
10 keV [148]). Nevertheless, they can be an important contribution to the 
degradation of the location of the primary track. 

The average number of ionisation pairs created within a length L of 
detector material is given by 


(aL 


(Nip) = (W) , 


where the use of mean values reflects the fact that the number of ionisation 
pairs, like the energy loss, is subject to straggling, while W, the energy 
required to create the pair, will be subject to variations in the sharing of 
energy loss due to ionisation and excitation. 

In gases (W) does depend on the energy of the incoming particle, but 
reaches a constant value for energies above | keV. The value in the high- 
energy limit depends on the material, and is usually about twice as large as 
the ionisation potential Z. It is similar for incoming electrons and photons, 
but larger by about 15% for œ particles hitting molecular gases. 

For a given energy deposition, the number of ionisation pairs will be 
given by the distribution of the energy required to create the pair, W. If 
that would be the same for all collisions, no additional fluctuations to this 
number would be introduced, due to energy conservation. Without this 
constraint, the number of ionisation pairs will be characterised by a Pois- 
son counting statistics and the variance is (Nip). In practice, the situation is 
somewhere in between, which is customarily described by [242], 


o? = F (Nip), (2.44) 
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See chapter 4. 


See chapter 5. 


Even though the ionisation charge in a 
cluster is produced closely together, the 
separation between clusters is usually 
lost when the charges are drifting 
towards a readout electrode, due to 
diffusion in the gas (see section 4.1). 


Because of the large transferred energy 
this energy loss will be predominantly 
due to the second term in eq. (2.35). 


incoming 
cosmic 
particle 


6-electron 


1 mm 


Figure 2.19: Charge deposition from a 
cosmic particle track in a He/iso-C4Ho 
80/20 gas mixture. Note the ionisa- 

tion clusters and the presence of a 
6-electron [179]. 


10° 10° 10° 
kinetic energy [eV] 

Figure 2.20: Energy per ionisation 
pair as a function of the energy of 
the incoming particle for COz [141]. 
The full line shows calculation using 
GARFIELD. The shaded band indicates 
the high-energy limit. 
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where F is called the ‘Fano factor’. Its value is between 0 (no fluctuations) 
and 1 (Poisson distribution). At high incident energies the value of the 
Fano factor is constant, but increases toward unity as the initial energy of 
an incident particle becomes close to the ionisation potential of the mate- 
rial because at low electron energies the non-ionising collisions become 
more likely. 

The Fano factor is relevant whenever we measure an energy by mea- 
suring the ionisation charge in a detector, for example in the measurement 
of dE /dx (see section 12.3), or in photon spectroscopy (see section 11.1). 

The fluctuations of the energy deposition between ionisation and ex- 
citations become smaller for decreasing values of (W). The Fano factor 
empirically correlates with (W) /I, with J the ionisation energy, as [167] 


F= ggg —O:15: 

In mixtures of gases where one of the components has excited states 
with energies exceeding the ionisation threshold of another component, 
excitation energy in the former can be transferred to ionisation of the lat- 
ter, which will boost ionisation loss and hence reduce W and F (‘Jesse ef- 
fect’ [293]). In mixtures without such an energy transfer, they are, to good 
approximation given by the values in the pure gases, weighted by their re- 
spective concentrations. Accurate predictions for these two parameters in 
gases can be made using the energy-dependent collision cross-sections for 
the electrons generated in the ionisation processes with molecules in the 
gas. Such an approach is taken in the GARFIELD simulation software, 
using the underlying MagBoltz code, with good success [445]. 

In semiconductors (W) is much less dependent on the energy of the 
incoming particle (only at the level of a few %), because the energy loss 
process is dominated by collective lattice effects. W is correlated with the 
band gap energy Eg, and its value at high energies can be empirically pa- 
rameterised as [222] 

(W) =2E, + 1.43 eV. 


In semiconductors it is the Fano factor that shows a much larger en- 
ergy dependence for reasons not fully understood. It is also temperature- 
dependent with higher values at elevated temperature. Generally, Fano 
factors for semiconductors are lower than for gases. 
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Table 2.1: High-energy values for 
W and F for selected gases and 
semiconductors [222]. 


Gas Wie] W/I F 
He 42.3 1.72 0.21 
Ne 36.6 1.70 0.13 
Ar 26.4 1.68 0.14 
Kr 24.1 1.72 0.17 
Xe 21.9 1.80 0.13-0.17 
CH4 29.0 2.23 0.27 
C2H6 24.9 2.13 0.25 
Solid WIN] [W] F 

Si 3.62 1.12 0.06 
Ge 2.96 0.67 0.06 
GaAs 4.2 1.43 0.14 


In practical silicon detectors energy 
resolution is often not dominated by ion- 
isation statistics, but by other effects like 
lattice imperfections etc. Measurement 
of the Fano factor in silicon is therefore 
difficult. 


Figure 2.21: Fano factor for Silicon as a 
function of energy ([222] and references 
therein). 
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2.7 RANGE OF CHARGED PARTICLES IN MATTER 


When a beam of particles with kinetic energy T hits a slab of matter, the 
particles will lose energy at a rate given by the stopping power (dE /dx), 
as discussed previously in this chapter. If the slab is thick and dense 
enough, the particles will lose all their kinetic energy and come to a stop. 
The distance at which this happens is called the ‘range’ of the particles. 
Because of the fluctuations in the stopping power discussed in previous 
sections, not all the particles will stop at the same distance and the par- 
ticle density in the beam will go to zero over some finite length (‘range 
straggling’). In many cases the relevant criterion is the essential absence of 
particles at a certain distance, and thus it is common to describe the range 
either as the distance at which the particle density has decreased to a small 
value (e.g. 5%), or, alternatively, using the ‘practical range’ Rp, which is 
given by the distance where the tangent with the maximum slope to the 
density function becomes zero. 

For high energy incoming particles the range straggling will be small 
compared to the total distance travelled. Also, because of the increased en- 
ergy loss at low velocities, the energy deposition is largest shortly before 
the end point of the particle track. This results in an energy loss profile 
that peaks towards the end of the range, with a fairly sharp fall-off. This 
profile for energy deposition is known as the “Bragg peak’ [161] and is, 
for example, exploited in particle beam therapy [387]. It allows for a high 
dose to be deposited locally in the region where we wish to kill cancerous 
cells, while having low doses on the surrounding healthy tissue. 

The range of particles as a function of T scales again with By. At 
high energies two regimes can be distinguished. Below relativistic ener- 
gies (By < 1) the energy lost will be from the B~*-dependent part of the 
charged particle energy loss. Above that, it will be dominated by the rel- 
ativistic rise, resulting in a slower growth of the range with By. At lower 
energies (By < 0.1) the stopping power is smaller again, a fact that is not 
correctly reflected in the parameterisation eq. (2.31) of the Bethe equation, 
but which can be reproduced with the correct treatment of ionisation in 
MagBoltz. 

An empirical parameterisation that works for a large range of energies 


is [313] 
B 
oF) 


A = (1.06Z~°78 +0.18) x 107? g(cm? keV), 
B = 0.227725 4.0.79, 
C = (1.32°7 +0.21) x 1073 (keV) €, 


PRo.o5(T) =AT (1 = (2.45) 


with 


with Z for mixed materials and compounds replaced by its average value, 
weighted over the mass fractions. 

This allows for an estimate of the range of 6-electrons. Eq. (2.45) pre- 
dicts a range of 80 um for a 1 keV electron and 1.6 mm for 10 keV in ar- 
gon gas. In liquids or solids the range will be about three orders of magni- 
tude smaller because of the density. 
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The measurement of the range can be 
used to estimate the initial energy of 
the particle. This is used for example in 
large neutrino experiments to estimate 
the energy of muons which stop in the 
detector volume. 


R, R, 05 
Figure 2.22: Definitions for the range of 
charged particles in matter. 


This profile is different than for a pho- 
ton beam, where the intensity decays 
exponentially with depth. 


Deposited dose 


Distance 
Figure 2.23: Typical dose deposition as 
a function of depth for a proton beam. 


As discussed above, in Argon ~ 5 x 
1074 of the electrons from charged 
particle energy loss have an energy 
above 10 keV. 
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2.8 CHERENKOV RADIATION 


In section 2.4 we have seen that part of the transverse energy loss can 
be attributed to a term that turns on at a threshold given by Bmin = 1/n. 
While the interaction of the incoming particle with the charges in the de- 
tector medium can be usually described by the exchange of virtual pho- 
tons, real photons in the form of Cherenkov radiation can be emitted, if the 
material is transparent (€2 — 0). 

A simple picture of this effect is a luminar boom. The particle is seen 
as the source of spherical electromagnetic waves, which expand with the 
speed of light in the medium, c/n. If the particle moves faster than this, 
the spherical waves will build up to a conical wavefront with an opening 
angle given by cos Och = (c/n) /v = (nB)!. 

The requirement for optical transparency is usually only satisfied in 
the optical region (at energies below the threshold for photoionisation), 
at photon energies of a very few eV. In this region the index of refraction 
n œ | — £; increases with energy. 

From eq. (2.37) we can see that the energy lost to Cherenkov radiation 


is given by 
J 1 : oda 
2n(@)? i 
B>n(@)~! 


dE ah 
dx c 


(2.46) 


The energy density of the radiation is proportional to the frequency. The 
integral stays finite, as the index of refraction approaches 1 for @ — œ. 
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Figure 2.24: Range of heavy charged 
particles in different materials (modified 
from [379]). M is the mass of the 
incoming particle, and p the density of 
the absorber. 


Ei cf 
of 5 
g 
OE 7 
3 
Kr e 
Qa Pa 
o d 
1F a 4 
8 
58 
F a8? 
107e Boat f x 
PNE 
kinetic energy [eV 


Figure 2.25: Measurements (squares) 
and MagBoltz calculations (circles) 
of the practical range of electrons 

in methane (at atmospheric pres- 
sure) [141]. 


Figure 2.26: Cherenkov cone. 


Eq. (2.46) is known as the Frank-Tamm 
equation [469]. 


As the index of refraction increases 
with energy the number of photons 
with a given wavelength is shifted 
towards the short wavelength end of 
the spectrum. This, together with the 
spectral sensitivity of the human eye, 

is the reason Cherenkov radiation from 
nuclear reactors immersed in water 
appears blue. 


Interactions of particles with matter 


The number of photons N = E/ha, and is then 


1 
(~ grate) = 
~ sin? Ocy (Ey) dEy(€V)~!em=!, (2.47) 


Ema 
~ 3702 f 
E 


min 


dN 2a Emax 
dx fic JE 


min 


where Ey is the photon energy. 

The amount of energy lost to Cherenkov radiation is only a small frac- 
tion of the charged particle energy loss, but it is the long range of the 
Cherenkov photons in transparent detectors, and the correlation of their 
direction with the direction and speed of the incoming particle that leads 
to this effect being used in a whole range of particle detectors. 


2.9 TRANSITION RADIATION 


As discussed in the previous section, in a material with index of 
refraction n > 1 Cherenkov radiation is emitted with a defined angle 

Och = acos(1/nB). However, if the material is contained within a lim- 
ited thickness L ~ A, the superposition of radiation from emission sites 
along the limited track length of the incoming particle in the thin slab of 
material will lead to diffraction of the Cherenkov radiation, and thus of a 
broadening of the emission angle [292, 48]. 

The difference of the phases of the radiation emitted at the start of the 
radiator (point A) and the end (point B), when it hits the line BC, is given 
by ®(0) = $a — Prg = (27//) [Leos 6 — L/(Bn)}. The diffraction spreads 
the emission in 0 around @c¢y, which we account for by using a double 
differential d?N /(d@dQ). The transition from a large slab of material to a 
thin one can then be described by 

1 
P) 


The broadening by the diffraction has the interesting consequence that 
real photons can get produced even if the Cherenkov condition Bn > 1 is 
not met. 

Outside of the thin layer is vacuum, and thus the radiation gets re- 
fracted when it exits the layer (sin n =nsin@). Hence we obtain 


dN 2aL 
dadQ  2rc 


> = ES ee 
2nc À 


sin20 5 (cos o za L into (“one 2) ) 7 


aN za 
dadQ! ~~ 22@n? 


(2.48) 


The sin? (®(@) /2) factor in the numerator can be interpreted as the 
interference of radiation emitted from the front and back faces of the di- 
electric. It goes to zero for L — 0, as necessary. 

However, eq. (2.48) cannot be complete, as it would predict the emis- 
sion of photons even in the case that the slab would be filled with vacuum 
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The integral extends over the energies 
where photons are observed, i.e. where 
Cherenkov radiation is emitted, but also 
where the photon detector is sensitive. 


This expression does not depend explic- 
itly on the density. However, typically 
the index of refraction, and thus the 
Cherenkov angle and the number of 
photons emitted per unit track length, is 
larger for denser materials. 


If we assume detection of Cherenkov 
radiation in the spectral range of visible 
light (Ey ~ 1 eV), the energy loss is 
about 1 keV x 2 sin? 8c per cm, about 
10-3 of the total charged particle 
energy loss in a solid. 


Figure 2.27: Geometry for Cherenkov 
radiation emission in a thin slab of 
material. 


The interference pattern is similar to 
that from a finite slit ( sinc”), because 
in both cases the pattern can be seen as 
the result of the superposition of a con- 
tinuous, uniformly distributed oscillator 
manifold with linearly changing phase. 


We ignore the effect of reflections. 


dQ! = 2nsinn dn. 
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(n = 1). We can correct this by applying the superposition principle. In- 

troducing the slab of material has replaced a corresponding layer of vac- 
uum and we have to subtract the equivalent amplitude for this layer, with 
n= 1 [48], 


ËN Za sin? n sin? OF y 2 _ sin? l x 
— n 
dodQ’ mon” Zc 1-3 


1 1 
n-5 cost] — 5 


2 


(2.49) 


The emission of photons described by this equation is called ‘transi- 
tion radiation’ (TR), as it requires the transition of the charged particle 
between layers of dielectric material and vacuum. 

The first term in the bracket in eq. (2.49) is singular for the Cherenkov 
condition in the dielectric. It is relevant for radiation in the optical fre- 
quency region [150]. There €; > 1 and £2 œ 0. In the resonance region 
above that the strong absorption prevents the propagation of real photons, 
but at very large energies (X-rays) the absorption length becomes macro- 
scopic again. This is the region where the second term becomes important, 
which is singular for very small angles and close to 1. In that region 

o2 
n(@)=1— D 


In these conditions we can also approximate 


2 
1 1 
sinn ~ N,cosn ~ 1 T 'B SEFE 
Eq. (2.49) then becomes 
PN 4an? | oL fo 1 
5 Se q sin? +n’ EF x 
dadQ T-O 4c \@ y 
2 
‘ 1 1 

o2 ERN 
ee ee 


If we remove the interference term we obtain the flux from a single 
face transition 


PN an? 1 1 


> 
doadQ’ mo hint m+ 


Integration over the energy gives the angular distribution of the single 
face radiated energy density J = Nh@) 


dl aha, n? 
dn 2 5/2° 
i (+n?) 
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The interference term remains the same 
as we are replacing a slab of vacuum 
with the equivalent slab of dielectric. 


The plasma frequency @, is defined in 
eq. (2.22). 


Here we have also approximated one 
5 sii 
factor of n? with 1. 


We have assumed vacuum outside of 

the dielectric slab. However, dielectric 
properties of the material outside can be 
accommodated by adding another fac- 
tor oz /@ with the plasma frequency for 
the outer material in the denominator of 
the second term of this equation. 

We use the small angle approximation 
dQ! ~ 270 d0. 
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As expected, the emission of X-ray transition radiation photons is 
close to the direction of the incoming particle, with the maximum at 


Nmax = Vy 3/27 !. 
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The intensity as a function of the energy scales with y@, and drops 
sharply beyond y@, ~ 1. To achieve significant intensities in the X-ray 
regime, where long distance propagation of the photons is possible, does 
require a significant Lorentz boost (y 2 100) for typical values of the 
plasma frequency. 

The total radiated energy at a single face is given by J = Zaho 13, 
increasing linearly with y. The total number of photons diverges for 
@ — 0, which is a consequence of our approximations in this region. 
However, in practical applications there will be a low energy cut-off, 
which we can use for the start of the integration. In this way we obtain 
N(@ > 0.15y@,) ~ 0.5z*a@ for a single face transition [71]. The small 
photon yield per face transition is the reason practical TR detectors com- 
prise a large number of dielectric layers. 

At this point we need to take into account interference and absorption. 
The former is described by the interference term. One implication of this 
term is that the dielectric layer needs to be sufficiently deep so that radi- 

ation can develop. The length scale for which this happens is called the 
‘formation zone’, and is given by 


—1 
„Ar 
~ 3m’ 


(Q 


@2 
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j y 


where we used the characteristic values @ ~ Y@, and n ~ y~! for the 
approximation. A consequence of the interference is that the transition 
radiation intensity for a foil does not continue to linearly increase with y, 
but saturates around y ~ \/3aL/A. While it appears therefore beneficial 
to increase the thickness of the dielectric, and also to increase the number 
of interfaces to increase the number of photons produced, in practice this 
increases absorption, and thus for practical detection of transition radiation 


a suitable compromise has to be sought. 
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Figure 2.28: Left: Single face TR 
intensity as a function of angle. Right: 
Single face TR intensity as a function of 
energy. 


This expression diverges for @ — 0. 
This is an artefact of the approximations 
used for n. 


There is no threshold for the turn-on of 
transition radiation. 


Again, the energy lost to transition 
radiation is a small fraction of the 
charged particle energy loss. It is useful 
for particle detectors because of the 
macroscopic penetration capabilities of 
X-ray photons, and the dependence on 
the y factor of the incoming particle. 


The interference term is then 
4sin(L/Ly). 


The formation zone is typically a few 
10 um. 
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2.10 MULTIPLE SCATTERING 


So far we have discussed the energy loss of charged particles due to in- 
elastic collisions with the electrons in the detector material, which ulti- 
mately generates the signal we require for observation. However, charged 
particles also interact with the nuclei in the detector. While such colli- 
sions generally do not lead to a detectable energy transfer, they can have 
significant effects on the particle trajectory, which has the inconvenient 
consequence of degrading the knowledge of the original direction of the 
particle. 

Depending on the number of scattering events, we distinguish three 
regimes: Single scattering, plural scattering (from 2 to about 20 scatters) 
and multiple scattering (more than 20 scatters). 


Single scattering 


In principle this type of scattering is described by the Rutherford scatter- 
ing differential cross-section 


do _ (“ane l Ge 1 

dQ \ 2Bcp ) sint (4) \ Bep Ai 
which diverges for very small scattering angles. However, in matter the 
reach of the interaction is reduced by screening of the nuclear charge by 
the shell electrons. This reduces long-distance, small-angle scatters and 


effectively introduces a lower cut-off angle Xa, with the differential cross- 
section becoming 


d 22Z2æħc\? 1 
o (= me) (2.50) 


d Bep / (+r 
Various calculations of the screening angle exist, but an important model 
by Molière [375] uses the Thomas-Fermi model to obtain 


2 
zZ 
=x 1.13-+3.76 ( =) 


with 

à 1 Az” 

a 0.885p a 

where a is the Thomas-Fermi atomic radius, ag = ñ / (meca) the Bohr 
radius, and p and p the momentum and speed of the incoming particle. 


xo = 
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Figure 2.29: Saturation and absorption 
of X-ray transition radiation [246]. 
Left: Intensity of the forward radiation 
(divided by the number of interfaces) for 
20 um polypropylene (h@, = 21 eV) 
alternated with 180 um helium 

(h@ = 0.27 eV). Right: Total forward 
intensity per interface for the same 
radiator. 


We assume that the scattering is sym- 
metric around the direction of the 
incoming particle, i.e. on spherically 
symmetric atoms, or, for multiple scat- 
tering, that asymmetric scatters are 
randomly oriented. 


It is common practice in the discussion 
of multiple scattering to denote the 
polar angle for single scattering by x, 
and for multiple scattering by @. 


This is sometimes called the ‘screening 
angle’. 


dQ = siny dydọ ~ xdxd¢. 
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Plural scattering 


There is no quantitative model for this, but it is usually treated as a transi- 
tion between the two other cases and results are interpolated. 


Multiple scattering 


Multiple scattering distributions are the result of a large number (>20) 

of collisions. In principle, one would assume that in accordance with the 
central limit theorem, the distribution of the overall scattering angle after 
multiple scattering is Gaussian. However, the single distribution scattering 
distribution is not falling quickly enough for large scattering angles for 
this to fully apply, and the observed distribution is dominated by a central 
Gaussian, with non-Gaussian tails at larger scattering angles, due to the 
rare large-angle scattering events. 


ae 18-66 MG /oy? Au 


t-—37. 28 MG /oy? Au 


GAUSSIAN 


FRACTIONAL SCATTERING PER SQUARE DEGREE 
5 
fS 
TA Ins E Se SL 


20 25 30 


5 10 15 
SCATTERING ANGLE IN DEGREES 


The probability density function f(0,t) that a particle has a direction 
given by the polar angle 0 after passing a thickness ¢ of the material must 
satisfy a transport equation 


af(@, d 
AOD ane fire- - £00, 


dx, 


where dø /dy is the single scattering cross-section given in eq. (2.50) af- 
ter integration of @, and ną is the number of nuclei per unit volume. In the 
Molière theory of multiple scattering [376] a characteristic single scatter- 
ing angle 

aa) 


2 
Xe =4anat ( 
Bep 


is used to define the parameter 


2 
7 x 
b=n( zz) 


where the argument in the brackets is the effective number of collisions in 
the target. The solution to eq. (2.51) can now be found as a power series in 
B~', where B is defined as the solution to the equation b = B — In B. The 


(2.51) 
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Figure 2.30: Angular scattering distri- 
bution of 15.7 MeV electrons from thick 
and thin gold foils [278]. 


The physical meaning of Xe is that on 
average a particle observes one scatter 
with a scattering angle larger than %e. 


In the discussion here we have ignored 
energy loss of the incoming particle, but 
Molière also describes a modification of 
Xa and %e to account for this [376]. The 
remainder of the treatment is the same. 


Typically, B is around 5 for thin ab- 
sorbers and increases logarithmically 
with target thickness to about 20. 
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distribution of projected scattering angles can then be expressed by the 
first three elements of the series 
Ee 


1 Po (0) (9! £0’) 
fede |g 6). 


dé’, (2.52) 


with 0’ = Oproj/ (Xc VB) a reduced scattering angle in units of y.VB, and 
the distribution functions f™ (0') defined by 


2 pe anal, PAT 
(n)(g’) — A. —y-/4 y x 1 
f™ (8) i Ra E In ( 3 )| cos(y@") dy. 


These distribution functions are generally not trivial, but the first (and 
dominating) one is simply a Gaussian, 
Oro 
x 


=- g cB 


In Oproj the standard deviation of this Gaussian is 7, \/2B. The other terms 
in eq. (2.52) describe the non-Gaussian tails of the distribution. 

Usually, the direction of the scattered particle is given in 3D space, 
and is symmetric around the direction of the incoming particle, so that 
multiple scattering distributions are depending on the polar angle Opace. 
However, it is often the projection onto a plane containing the direction of 
the incoming particle that is relevant. In that projection the direction of the 
scattered particle is given by an angle proj- 

The non-Gaussian tails of the multiple scattering distribution are usu- 
ally very small (at the level of 107°), and thus can be neglected for the 
estimation of errors in tracking in particle physics experiments. It is there- 
fore common to use the width of the Gaussian core as the sole measure to 
parameterise the effects of multiple scattering on tracking. Early on [433] 
it was noticed that the scaling by (N4/A)Z? invites the use of the radia- 
tion length Xp as a scale variable to describe multiple scattering. High- 
land [286] developed a parameterisation for this, for which Lynch and 
Dahl [353] have found improved parameter values which are today gener- 
ally accepted as a standard description of multiple scattering [505] 
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where x is the distance travelled through the material. 

Moliére theory (and other descriptions of multiple scattering) makes 
predictions of the scattering angle but not for the displacement of the par- 
ticle after the passage through a given slab of material. For the simula- 
tion of the impact of multiple scattering on tracking performance the po- 
sition of the particle along its trajectory must be determined. Two main 
approaches are used: In ‘detailed’ simulations a complete sequence of in- 
dividual scatters is generated. The results from such a calculation is accu- 
rate, but due to computational demands this method is typically limited to 
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Figure 2.31: The first three distribution 
functions f™ (6). 


Numerical values for the distribution 
functions have been given in Molière’s 
original paper [376] and in a follow-up 
paper by Bethe [131]. Simple empirical 
representations are given in [248]. 


That plane, for example, can be the 
bending plane of a magnetic spectrome- 
ter. 


In particle therapy the tails cannot be 
neglected. 


Multiple scattering is not a radiative 
process. Nevertheless, the radiation 
length is still a useful parameter here, 
as the underlying electromagnetic 
interactions are similar. 

It is amusing that Lynch and Dahl in 
their paper propose a parameterisation 
they consider better than the Highland 
form (and is not relying on the radiation 
length). However this parameterisation 
has not won general acceptance. 


ia is the standard deviation of the 
3D spatial scattering angle distribu- 
tion, which for small angles can be 


approximated by 


P(0)dQ = 


G is the standard deviation of the 


angle in a projection of this distribution 
onto a plane containing the direction of 
the incoming particle. 
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a few hundred scatters. The second approach are ‘condensed’ simulations, 

which make use of a multiple scattering model like Molière theory. The 

track is divided into segments (steps) that are still long compared to the 

mean free path, and the displacement due to multiple scattering for each 

segment based on the underlying multiple scattering theory is computed. 

The algorithms used for this are not exact and the main source of errors 

in these Monte Carlo codes. In particular, results can depend on the step 

size, and only become stable if these are sufficiently small, thus becom- 

ing computationally demanding. More recently, ‘mixed’ approaches are For an overview of the methods em- 
Š i r i 3 ployed by GEANT see ref. [478]. 

being explored, which combine condensed simulations for small scatter- 

ing angles with a limited number of detailed calculations of large angle 

scatters [248]. 


2.11 INTERACTIONS OF ELECTRONS WITH MATTER 


Electrons as charged particles will lose energy in interactions with shell 
electrons similarly to other charged particles. In principle, this interaction 
follows the same principles as outlined previously for other charged parti- 
cles, with adjustment for the kinematics and interaction cross-sections. 

In addition, as electrons have a low mass they do have sizeable cross- 
section for bremsstrahlung. 


CRITICAL ENERGY 


If we compare the energy loss by bremsstrahlung, eq. (2.13), to the en- 
ergy loss to ionisation and excitation as, for example, given in eq. (2.31), Tay 
we note that the radiative energy loss scales dominantly with Z compared sue 
to Z, as it is due to an interaction with the nuclei rather than the shell elec- Bg LO ee E 
trons. The radiative energy loss increases with the energy E of the incom- Š 0.8 
ing particle. Hence it will dominate at high energies. The energy at which 8 0.6 
the radiative energy loss exceeds the loss to ionisation and excitation is “oa — > 
called the critical energy Ee. 02 Sow 

The exact definition of Ee is ambiguous. Two definitions exist, one i : baal Se f 
where the ionisation energy loss equals the radiative energy loss at the - G mav? 19 
critical energy, Figure 2.32: Fractional energy loss to 


dx 


energy loss by radiation for electrons 
dx per radiation length of air and of 
E=Ec lead [433]. 


the other where the ionisation energy loss equals the radiative energy loss 


at high energy, 
(Ey (57 7 E 
dx J Eo dx / Eo Xo l 


The latter is the one recommended by the PDG [505] and we follow that 
recommendation. 

The critical energy can be parameterised as 710 MeV/(Z + 0.92) (for 
gases) or 610 MeV/(Z + 1.24) (for solids) to within a few %. For muons, 
the lightest charged particle with a mass above the electron, the critical 
energy is of the order of TeV. 

The bremsstrahlung radiation spectrum is extending over the full range 
up to the energy of the incoming particle. It peaks for small energies. The 
probability for a large energy transfer is driven by screening effects, with a 


2 


( dE ) ion ( dE ) rad ionisation and excitation, and fractional 


E=E, 


200 i 400 rz 
Copper k Wa | 
Xo = 12.86 g cm~? 200 = i) 
100 $- E,=19.63 MeV © < 
> E S Ws 
Z o- S ea Se nome | 
£ Fa nS | aa Dy e’ J 
of 50E S 4 85 6loMev 7% Ss y+ 0.92 ] 
dD E D S eh Z+ 1.24 Stina 7 
x aS S J oy DS 
< 36 4 ` | 
m J 20 + Solids 
D ization J 20 [- S WO ~ 
m Ionization 4 © Gases a 
i 10 — sy 
l J He Li BeB CNONe LP D 
10 
5 10 20 50 100 200 3 1 2 5 10 20 50 100 
Electron energy (MeV) vA 


higher probability for the emission of a high-energy photon for incoming 
particles with higher energy. 


E/EXfE,E,) 


Classically, the radiation in the radiation zone is a dipole radiation in 
the frame of the electron, with the maxima perpendicular to the acceler- 
ation. Lorentz transformation into the lab frame gives a narrow, but not 
sharp, cone of radiation with a cone angle of 6 = 1/y. 


2.12 HADRONIC INTERACTIONS 


While the processes involved in electromagnetic interactions are well un- 
derstood, hadronic interactions are significantly more challenging, as they 
involve complex nuclear physics and low momentum transfers, so that per- 
turbative QCD is not applicable. 

Before we consider interactions of hadrons with nuclei, we will first 
consider the simpler case of hadronic interactions on protons. The total 
cross-section is found to change slowly with the energy in the centre-of- 
mass system. 

There is a significant fraction of elastic scattering. The inelastic cross- 
section is dominated by processes with low momentum transfer and there- 
fore cannot be predicted using perturbative QCD and we rely on simple 
phenomenological models. The most crucial empirical observation is that 
the secondary particles have low transverse momentum pr (relative to the 
direction of the incident hadron), with a mean value of (pr) ~ 400 MeV/c. 

In the longitudinal direction (parallel to the incident hadron) the sec- 
ondary particles are distributed approximately uniformly in rapidity 


1 ee 
ia a (2.54) 
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Figure 2.33: Left: Electron energy loss 
due to bremsstrahlung compared to 
energy loss due to ionisation (modified 
from [379]). Right: Critical energy for 
materials with different Z [505]. 


Figure 2.34: Differential radiation 
probability per radiation length in 
lead for electrons of various energies 
(modified from [432]). The vertical 
scale is defined by 


dE E E, 
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Figure 2.35: Bremsstrahlung emission 
direction in the electron frame (left), 
and in the lab frame (right). 

This is again an example for the head- 
light effect (see exercise 4). 


In the lab frame elastic scattering 
results in energy transfer from the 
incident proton to the target proton 
and therefore does result in a hadronic 
secondary. 


The mean transverse momentum (pr) 
increases with the centre-of-mass en- 
ergy (\/s) and can empirically be pa- 
rameterised as [195] 


(pr/(GeV/c)) = 0.413 — 
— 0.0171 In(s/GeV?) + 
+0.0143 In?(s/GeV’). 
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where £E is the energy and py is the momentum component in the direction 
of the incident hadron. It can be shown that at high energies (compared to 
the mass) the rapidity approaches the pseudorapidity defined by 

n = —Intan (0/2), (2.55) 
where @ is the angle to the incident particle. 

The rapidity increases by a constant value in Lorentz boosting from 
the centre-of-mass to the lab frame. Therefore, in the lab frame the sec- 
ondary particles produced in the hadronic interaction will tend to have 
large values of 1), i.e. they will be at small values of 0. This implies that 
the secondary particles tend to be emitted in a narrow ‘jet’ of particles in 
the direction of the incident hadron. 

The interactions of hadrons with nuclei have many additional features. 
The incident hadrons can interact with the entire nucleus (‘coherent scat- 
tering’), but the most important process is still scattering off an individual 
nucleon. This will result in secondary hadrons in a very similar way to that 
for a proton target. The secondary hadrons can interact with other nucle- 
ons in the same nucleus and create an intra-nuclear cascade. Some of the 
secondary nucleons or created hadrons can also escape the nucleus. The 
nucleons that are captured will undergo further collisions such that the 
nucleus ends up in thermodynamic equilibrium in a highly-excited state. 
This excited nucleus will decay by emitting nucleons or light nuclei (e.g. 
deuterium, tritium or helium), a process that is called ‘spallation’. Some of 
the incident energy ends up in nuclear recoil. 

At the end of the spallation process, the remaining excitation energy is 
insufficient for more nucleons to escape and the nucleus fully de-excites 
by emitting ys with an energy of ~MeV [342]. 

Energy is conserved in these processes but the energy used to release 
nucleons from the nuclei is effectively lost for detection. An additional 
part of the energy that escapes detection is due to neutrinos from the weak 
decays of pions and kaons (e.g. T — Vy). 

Hadronic interactions are primarily important for the detection of neu- 
tral hadrons, as they do not directly lose energy to ionisation. It is rather 
the ionisation generated by the secondaries created in the hadronic interac- 
tions, or the nuclear recoil, which we can use for detection. The absence of 
ionisation loss also accounts for the high penetration capabilities of neutral 
hadrons. The most important neutral hadron, of course, is the neutron. 
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Figure 2.36: Total cross-section for 
the scattering of pp and pp (modified 
from [379]). Cross-sections for other 
hadronic projectiles look similar 
(see [515]). 


See exercise 9. 


This is the compound nucleus model. 


Spallation is used to create very intense 
beams of neutrons in spallation sources, 
such as ISIS at the Rutherford Appleton 
Laboratory, U.K. 


In very heavy nuclei like uranium, low 
energy neutrons can induce fission, 
which results in the emission of more 
low energy neutrons. 


This is a small effect in dense materials 
used for hadron calorimeters at accel- 
erators but is a very significant effect 
when studying the hadronic showers in 
the atmosphere induced by cosmic rays. 
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LENGTH SCALE OF HADRONIC INTERACTIONS 


The ‘nuclear interaction length’ is defined as the mean free path of a 
hadronic particle before undergoing an inelastic collision. It is given by 


1 Ma 
~ Napoin’ 


Aint = (2.56) 


NOin 
where Oj, is the inelastic cross-section, n is the volume number density 
of the nuclei, m, their atomic mass (in g/mol), p the mass density and Na 
is Avogadro’s number. As discussed in section 2.12 the typical hadron- 
hadron cross-sections only vary logarithmically with energy. 

We can make an approximate estimate of Ain: based on the scaling of 
the radius of nuclei with atomic number A and assuming that the cross- 
section is given by the geometrical cross-section, © = 7R*, with the nu- 
clear radius R = RA! and Ro = 1.2 fm. 

The values of the interaction lengths can be calculated more accu- 
rately. The nuclear interaction length is conventionally defined by inter- 
actions of neutrons with a momentum of 200 GeV/c. At high energy the 
nuclear interaction length changes very little with momentum. The dif- 
ference between neutron and proton interaction lengths is negligible in 
this energy range. However, z~ have significantly smaller nuclear interac- 
tion cross-sections, and thus the interaction length for these particles are 
longer. 

As we will discuss in chapter 11, the radiation length Xo and the nu- 
clear interaction length Aint define the dimensions of high-energy electro- 
magnetic and hadronic calorimeters, respectively. These two length scales 
have a different dependence on the size of the nuclei in the target mate- 
rial. While pXo with the mass density p scales with A/Z? (eq. (2.17)), i.e. 
decreases for larger nuclei, PAint scales with A!/3 and thus increases for 
larger nuclei. The two are comparable for small absorber nuclei, but for 
large absorber nuclei, nuclear interaction lengths are typically an order 
of magnitude longer than radiation lengths. In chapter 12 we will discuss 
how we can use this difference to our advantage for electron/hadron sepa- 
ration. Exercise 10 compares these measured values with simple estimates 
based on the atomic number. 
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There is also the ‘nuclear collision 
length’, which is smaller than the nu- 
clear interaction length because it 

also includes elastic and quasi-elastic 
(diffractive) reactions. 

Here we focus on the interaction length, 
as it is the relevant scale length for 
creating secondaries that result in a 
signal in a detector. 


Another common estimate for the inter- 
action length uses the black disk limit 
o(hN) = o(hp)A2/?, where o(hN) 
and o(hp) are the cross-sections for 
the strong interaction of an incom- 
ing hadron with a nucleus and a pro- 
ton, respectively, and the observation 
that o(hp) is about 37.5 mb (see Fig- 
ure 2.36), so that in the energy range 
1 to 1000 GeV 
re 35 AB. 

p/(g/cm’) 
with the mass density p. The error 
associated with these approximation 
can be estimated from Figure 2.36. 


(2.57) 


Table 2.2: Comparison of interaction 
lengths Ain with radiation lengths Xo 
for common solid absorber materi- 
als [379]. 


Element Xo [cm] Aint [cm] 
Fe 1.76 16.8 
Cu 1.44 15,3 
Pb 0.56 17.6 
U 0.32 11.0 


Figure 2.37: Nuclear interaction length 
pa and radiation length pXo for differ- 
ent elements (data from [271]). To get 
to a length in centimetres, the value in 
the plot needs to be divided by the mass 
density p in g/cm’. 
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2.13 INTERACTION OF NEUTRINOS WITH MATTER 


Neutrinos (and anti-neutrinos) can only interact with matter by weak 
interactions. They can do that either via charged current or neutral cur- 
rent interactions. From dimensional analysis the cross-sections will vary 
with centre-of-mass energy (v/s) as o © sGz/(hc)*, where Gp is the Fermi 
coupling constant. 

In the lab frame, if the energy of the neutrino is E and the target mass 
is m, then s = 2mc?E. We can then see that the neutrino cross-section for 
Ve scattering will be three orders of magnitude smaller than for scattering 
of a neutrino off a nucleon. The consequences of these very small cross- 
sections are explored in exercise 11. 

For charged current neutrino interactions we need to consider different 
types of processes as the momentum transfer (Q?) increases: 


e At very low values of Q? we have quasi-elastic scattering in which 
the target nucleon remains intact and no additional particles are cre- 
ated, e.g. Vep > ne*; 


e At higher energy we can have coherent production of mesons, but 
the target nucleon remains intact, e.g. ven > e~ pn; 


e Atenergies Ey ~ 1 GeV we can have nuclear resonances, e.g. the 
A in reactions like Vyn — uA, which subsequently decays, 
eg. A nt; 


e For higher energies we enter the Deep Inelastic Scattering (DIS) 
region. In DIS the neutrinos interact with a single quark in the nu- 
cleus, allowing for QCD theoretical predictions. 


In addition, the struck nucleon can have final state interactions with other 
nucleons in the nucleus in a similar way as we discussed for hadronic in- 
teractions. 

The one difference for neutral currents is that at low energy we can 
have elastic scattering instead of quasi-elastic scattering, e.g. Vue — Vue. 

In all instances it is the charged particles (or photons from 2° decay), 
produced or liberated in the primary interaction of the neutrino, that inter- 
act with the detector material in a way described previously in this chapter 
and thus create a signal. 
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Figure 2.38: Neutrino charged current 
(top) and neutral current (bottom) 
reactions. 

This will turn out to be invaluable for 
detecting low energy Ve. 
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Key lessons from this chapter 


e There are three dominant ways in which photons lose their energy in matter, photo- 
electric effect (dominating at low energies), Compton scattering (dominating around 
1 MeV), and pair production (dominating at energies >1 MeV). The absorption cross- 
section for the photoelectric effect does increase strongly with Z, while the increase is 
more moderate for pair production, and much smaller for the Compton effect. 


e Charged particles lose energy in electromagnetic interactions with the electrons in 
matter. The mean energy loss can be described by the Bethe equation. It is charac- 
terised by a drop œ B7? at low energies, a minimum at By ~ 4 (which is referred to as 
a minimum-ionising particle), and a logarithmic rise at higher energies. This increase 
is larger for less dense material and the measured energy depends on the thickness of 
the material. 


e The energy transfer from a charged particle to the material is happening through a 
series of small-energy transfers. The distance between collisions and the energy trans- 
ferred in each collision have stochastic distributions, and thus the energy deposited 
in a finite slab of material will display a straggling distribution with finite width and 
a long tail to high energies. The energy transfer can result in ionisation and/or excita- 
tion of the absorber material. 


e lonisation electrons liberated in very high energy energy transfers (>keV) are 
called 5-electrons. 


e Other forms of charged particle energy loss are Cherenkov and transition radiation. 
They only constitute a small fraction of the charged particle energy loss, but under 
certain conditions they can result in the production of long range photons that can be 
used for detection. Also, the properties of this radiation does depend on kinematic 
properties (B or y) of the incoming particle, which thus becomes accessible for mea- 
surement. 


e The most powerful approach to calculation of charged particle energy loss are dielec- 
tric models of the energy transfer like the PAI model and its extensions. 


e Electromagnetic interactions with the nuclei in the material will result in multiple 
scattering of the particle, which does not lead to energy loss in the material, but a 
deflection of the particle trajectory. 


e Light charged particles (primarily electrons) will also lose energy to bremsstrahlung 
in interactions with the nuclei. 


e Bremsstrahlung is a closely related process to pair production and thus they can be 
described by the same length scale variable, the radiation length Xo. The radiation 
length scales with p~! and Z*/A (for the dominating interactions with the nuclei in 
the matter). The radiation length is a property of the absorber material. 


e While not a radiative process, the similarity of the interaction with the nuclei 
in the material also makes the radiation length a useful length parameter to de- 
scribe multiple scattering effects. 
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e Hadronic interactions are considerably more complex, but we also assign a character- 
istic length to hadronic interactions, the nuclear interaction length Aint, which is again 
a property of the material. For high Z the interaction length is significantly longer 
than the radiation length. 


e Neutrinos interact with the detector through weak interaction processes, and the ac- 
tual signal in the detector is provided by the charged particles created or liberated in 
this interaction. 


EXERCISES 


1. Energy dependence of the photoelectric effect. 


The rate of scattering of a photon off an atom can be found from Fermi’s Golden Rule 


2 
P= |(s1H") PoE), 


with the interaction described by a perturbation Hamiltonian H’ and the density of states p (E). 
The photon can be described by the vector potential A=Aoé exp(ik - F), and the interaction Hamil- 
tonian H’ = eA- p/m. 


a) Evaluate the matrix element for the interaction with a hydrogen atom in the ground state 
(li) = (zał) !/7exp(—r/ao), with the Bohr radius ao) and the assumption that the final state 
can be described by a free electron wave function |f} = exp(ip-7/h). (Use ġ = (Ak — p)/h to 
describe the momentum transfer to the atom.) 


b) For high energy photons (well above the binding energy of the atom) the electron momen- 
tum will equal the momentum transfer. What will be the angular distribution of the electron 
emission? What approximation can you make to the result from part a)? 


c) Show that the density of states is proportional to p. 
d) Find the differential cross-section do /dQ =T /(ħ@Aĝ) and the cross-section o and demon- 
strate that the latter in this approximation is proportional to g~’, or Ey K 
2. Compton scattering. 
Verify eq. (2.10). Hint: use 4-momentum conservation to “isolate and square” the 4-momentum of 
the scattered electron. 


3. Weizsäcker-Williams model (see also [498, 290]). 


a) Show that the field of a point charge moving in the x; direction passing a system S at a dis- 
tance b from the direction of motion at the location of the system is given by 


_ qyb 
E(t) = (b2 + 72v272)3/2’ 
B3(t) = BE2(t), 

= qyvt 
E\(t)= (b2 + y2v242)3/2’ 


with other components vanishing. 
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b) For P ~ 1, E2(t) and B3(t) are equivalent to a pulse of plane-polarised radiation in the direc- 
tion of the motion. E; (t) is technically lacking a B field to form a similar transverse pulse. 
However, if the motion is non-relativistic in the frame of S, one can add the missing B field 
without changing the physics, as the particles in the system only respond to electric forces. 
In the Weizsacker-Williams model the particle is seen as accompanied by a cloud of virtual 
photons. The frequency spectrum of these virtual quanta for the fields in part a) for a given 
impact parameter b is given by 


di(@,b) c 2 
Ne ee 
Je zz E 


where E (œ) is the Fourier transform of the transverse field in the pulse. 

Find the Fourier transform of the electric fields and hence the frequency spec- 

trum of the pulses. (You will need that [2 exp(iax) /(1 +.x?)?/? dx = 2aK; (a) and 
[2n xexpliax)/(1 +x?) dx = 2iaKo(a), where the K; are modified Bessel functions.) 


To find the frequency spectrum of the radiation pulse for the interaction with matter we 

have to integrate this over all impact parameters from a minimum impact parameter Dyin. 
From the uncertainty principle bmin = /Qmax-. Show that for bremsstrahlung this is given by 
bmin = h/(2Mv). 

Find d/(@)/d@ = f,”. di(@,b)/d@. What are the limits for this spectrum for low and high 
frequencies? 


c 


wm 


d 


T 


4. The ‘searchlight’ (or ‘headlight’ ) effect. 


a) As derived in exercise 3 part d), the high-frequency tail for the spectrum for B — 1 is given 
by 


dn(m) 1 di(@) @ — 22% min 
do fa do o 

b) The critical frequency @, describes the end point of the spectrum. Show that this frequency 
can be described by @ ~ Y¥c/Dmin. If the radiation is seen as a beam with waist bmin, argue 
that the opening angle is given by y~!. 

5. Derivation of the Thomas-Reiche-Kuhn (TRK) sum rule. 
Start from the definition of the dipole oscillator strength, eq. (2.4). Show that this can be written as 
i 


Sn = 35 


[Po] D Pn) Pn Po) — (Polx| Bn) (Pn P| %o)] , 


then use the closure relation X, | Yn) (Yn| = 1 for the hydrogenic wave functions to show eq. (2.7). 


6. Derivation of the Bethe equation for energy loss using non-relativistic quantum mechanics 


(NRQM). 
If you are already familiar with the NRQM derivation of Rutherford scattering start at b). 


a) Consider the scattering of a charged particle with mass M on an electron with mass me for 
M > me. The differential scattering cross section is given by 


do | m= |V(q) 
dQ 4n2| 7 
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where the matrix element for Coulomb scattering is 


e2 


4mteor’ 


V(q)= | Prexe(-ig-7) 


where g = py — pj and pẹ (pj) is the final (initial) momentum. Show that the differential 
cross-section is 


do e? > 
d (a sin? TA l 
where @ is the scattering angle and p the momentum of the incoming particle. 
b) Let p be the magnitude of the 3-momentum of the incident charged particle in the rest frame 
of the electron before the collision and v be the speed of the incident charged particle. Show 


that in the frame at which the electron was initially at rest, the kinetic energy of the scattered 
electron is T = (p? /m)(1 — cos 0). Hence show that 


do 2me? 


dT mv2T’ 
c) The rate of energy loss per unit length in a medium with atomic number Z is then given by 


dE wy im do 
dx z Tmin dT’ 


where N is the number of atoms per unit volume. We assume that Tmin is the ionisation en- 
ergy I. Show that Tmax © 2B?Y mec”. Hence show that the mass stopping power is 


1dE KZ in (2m) 


pdx maß? I 


where K = 42Nar2m,c?, re = e? / (4T£omec?), ma is the atomic molar mass of the absorber 
and N4 is Avagadro’s number. 


7. Calculation of the Thomson cross-section. 
a) Consider a transverse electromagnetic wave in vacuum with the electric field propagating in 
the z direction, E = Ey exp{i(kz — @r)|&. Determine the magnetic field B. 
b) Calculate the Poynting vector. 


c) Calculate the acceleration of an electron in this field. Use Larmor’s formula to determine the 
emitted radiation field. Determine the Poynting vector for the radiated field. 


d) Combining the results in part b) and c) determine the differential cross-section for Thomson 
scattering and finally integrate this to get the total cross-section. Check your answer with 
eq. (2.9). 


8. Cherenkov light in the eye. 


Some people claim that muons from cosmic radiation can be observed through the Cherenkov radi- 
ation in the human eye. Comment on the plausibility of this statement. (The human eye will trigger 
a conscious response to bursts of light with about 10 photons over 100 ms.) 


After you have found your answer, look at [364]. 
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9. Rapidity and pseudorapidity. 
Show that for energies E such that E >> m, where m is the mass, the rapidity y is equal to the pseu- 
dorapidity n. 
10. Nuclear interaction lengths. 
Estimate the hadronic interaction lengths using eq. (2.57) and compare with the values given in 
Table 2.2. 
11. Neutrino interaction cross-section. 


Consider a neutrino experiment with 1 MeV neutrinos. Estimate the cross-section on protons. The 
target consists of a cubic box of water with sides of 1 m. Approximately how many neutrinos will 
have to hit the target in order to have 1000 interactions on a free proton (i.e. a hydrogen atom)? 
Clearly, neutrino experiments will need a combination of intense sources and large detectors. 


[Gp/(hc)? = 1.1 x 1075 GeV ~”, the proton mass mp = 1.7 x 1071? kg = 0.939 GeV/c”, and you 
can use fic = 0.2 GeVfm.] 


‘3 Electronic signals and noise 


In chapter 2 we considered the different ways in which high energy parti- 
cles deposit their energy in matter. Now we will consider how to use the 
results of these interactions to create a measurable signal in a detector. 

One approach is to use the ionisation created in the detector material. 
We will consider the alternative approach of detecting excitations in chap- 
ter 5. 

As we have seen, the typical energy loss for a minimum-ionising 
charged particle is about 2 MeVcm?/g, or typically a few 100 electron/ion 
pairs per cm of gas or a few 10000 electron/hole pairs in a 500 um thick 
semiconductor. If these charges flow over a time of a few 100 ns this re- 
sults in currents of nA and below. It is the challenge of electronic parti- 
cle detectors to make such a small signal detectable among the electronic 
noise that is intrinsic to any electronic readout system. 

Whether the signal will be visible above the noise will depend not on 
the absolute magnitude of the signal (S) or the noise (N) but the signal- 
to-noise ratio (S/N). Clearly, if S/N < 1 it will be impossible to separate 
the signal from the background and conversely, if S/N >> 1 then it will be 
very easy to separate the signal from background. The required value of 
S/N will depend on the requirements of a particular system but we can get 
a first feeling for the required value in exercise 1. 

An alternative parameter describing the noise performance of a 
charge-processing readout system is the equivalent noise charge (ENC). 
The ENC is defined as the charge at the input which would give the same 
magnitude charge signal as the noise. The ENC is usually give in number 
of electrons but it is sometimes quoted in units of fC. 

To understand the performance of electronic particle detectors it is 
therefore necessary to consider the process of signal generation and noise 
theory. In this chapter, we will first discuss how electronic signals are 
generated and how they can be calculated, followed by a short introduc- 
tion into noise theory. We will then discuss how electronic filtering can 
be employed to optimise S/N. In order to understand the actual values of 
S/N that can be achieved, we will need to consider the noise in transis- 
tors. We will then be able to combine the results to discuss overall system 
optimisation. 


3.1 ELECTRICAL SIGNAL GENERATION 


When electric charges are close to grounded conductors, they induce an 
electric charge on the surface of the conductor. The magnitude of the in- 
duced charge depends on the distance to the conductor. When the charge 
moves, the induced charge densities on the conductor will change, causing 
a current to flow. 

We can use the Ramo-Shockley theorem [455, 418] to calculate the 
induced current. To derive this theorem we consider a system of several 
grounded electrodes and a mobile charge g. Between the conductors and 
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This ionisation charge could be elec- 
trons and positive ions in a liquid or gas 
or electrons and holes in a semiconduc- 
tor. 


As will be discussed later, this charge 
can be amplified inside the detector 
using very large electric fields, but not 
usually by an amount that would change 
our conclusion here. 


Sometimes also the acronym SNR is 
used. 


It is the charge signal which would 
result in an S/Nof 1. 


This is demonstrated for a very simple 
case in exercise 3. 


This derivation follows very closely that 
given in [464]. 
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the mobile charge there is no charge. Therefore, the potential V satisfies 
Laplace’s equation there, V?V = 0. 

Next, we will consider a second configuration, where we remove the 
mobile charge and put 1 V on one conductor (C4) resulting in a charge Q; 
on this conductor. Again, between the conductors there is no charge, so the 
new potential V; also satisfies Laplace’s equation there, V*V, = 0. 

Consequently, the superposition V; V7V — VVV, also equals 0 be- 
tween the conductors. Applying Green’s second identity to the volume 
between the conductors and a sphere surrounding the moving charge gives 


J (ev —ven) av= ¢ (vivv vom) dž =0, 


where the closed surface integral covers the surfaces of the conductors and 
the sphere around the moving charge. 
We can split the right hand side integral into three parts: 


1. The integral over all conductor surfaces except Cı vanishes because 
V = Vi = 0 on these conductors. 


2. On the surface of C1, Vı (C1) = 1 V, and V = 0 and using Gauss’s 
law we get a contribution Vı (C1) x $ VV - ds’ = Q1Vı (C1)/£0. 


3. On the surface of the sphere the second term in the integral vanishes 
because there is no charge enclosed by the sphere for V;. Applying 
Gauss’s law to the first term and reducing the radius of the sphere 
to 0 gives shore V1 WV -d5 > qv (7)/€ where 7, is the position of 
the mobile charge. 


Combining these results, we obtain for the charge induced on the conduc- 
tor C1 


= Vi (7) — = 
Qı ACA = qV (F4), 


where we have introduced a weighting potential Vw, with dimensions of 
inverse length. 

It is impractical to measure a static charge, but if the charge moves 
with a velocity Y, the induced current on C4 is 
dQ; dVw 


a Ta 


i qÝ Vw: F(t) = —qEw V(t), (3.1) 
with the weighting field Ey = —Wy,. This is the Ramo-Shockley theorem. 
The Ramo-Shockley theorem is derived under the assumption that 
magnetic effects are negligible and the electric field propagates instan- 
taneously. In addition we assume that the charges flow freely in a non- 

conducting medium such as a gas. 

The weighting field has dimensions of inverse length. It generally has 
no relation to the actual electrical field. It depends only on the geometry 
of the electrodes and determines how the motion of the charge couples 
to a specific electrode to induce a signal. It is obtained by grounding all 
electrodes in the system apart from the one on which the induced current 
is measured, and placing that electrode at unit potential. Only for a system 
of two conductors will the weighting field have the same form as the real 
electric field. 
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Numerically, Vy (F4) = Vi(Fq), but 
without the unit, as we have defined Vi 
on the surface of the conductor to be 
lV. 

If only the total charge induced on C; is 
required, it can be found from 


i XF _, 
Qi = fia=-4f Ey -%= 
ži 


= q [Vw (i) — Vw (xe)] - 


where xi and xç are the starting and the 
final positions of the moving charge. 


The effects of charges flowing in a 
conducting medium are examined 

in [423, 424] and a derivation which 
allows for relativistic effects is given 
in [426]. 
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To get an induced current, we need to make the free charges move. In 
electronic particle detectors we use electrical drift fields in the detector to, 
first, separate the ionisation charges created by the interaction of the in- 
coming particle with the detector matter, and second, to create the motion 
of the charges required to induce a current on the readout electrodes. The 
current is induced from the moment the ionisation charge is created and 
starts to move, for as long as the charge is moving, and it ends when the 
movement of the charge stops (typically when it reaches the oppositely 
charged electrode). 

We will illustrate the Ramo-Shockley theorem here for a simple paral- 
lel plate capacitor and a single electron-ion pair created in its gap. 


I 
I 
z=D 
E We i So Z=Z, Qa 
YEV- z q 
z=0 
L 


For this geometry the electric field is E = —Vo/D. The weighting field 
is EL = —(1/D)z for the upper electrode, while it is E2 = (1/D) for the 
lower electrode. For simplicity we will assume here that the drift velocity 
for the charges is proportional to the drift field (v = WE, with the mobil- 
ity u) and thus constant in this geometry. Therefore the induced currents 
on the two electrodes are 


from electron from ion 


b(t) =-h(t). 


Q= k Idt, and thus the total charge induced on the upper plate is 


° D= zo Z0 
tot — d= q t His q T 
Qi [ 1 p (rele + V+ 4) D Ve Ve a ee q 
This discussion illustrates the use of the Shockley-Ramo theorem for the 
simplest possible case. We will use it for more complicated geometries in 
later chapters. 


3.2 ELECTRONIC NOISE 


Electronic noise is a fluctuation of the output of the electronic readout 
chain of a detector that is not caused by the input signal of interest. All 
components in the readout chain contribute to noise. 

Electrical current describes the motion of charge carriers. In a metallic 
conductor these are typically electrons, but in other media other charge 
carriers can contribute as well, for example, holes in semiconductors. If 
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In silicon detectors where speed is not 
a concern (for example some mobile 
phone cameras) diffusion of charge 
carriers can be sufficient. 


Sometimes in the literature the term 
‘charge collection’ is used, slightly 
carelessly. It needs to be understood 
that the process of collection happens 
during the approach of the charge 
towards the electrode, and it ends 
with the arrival of the charge on the 
electrode. 


Figure 3.1: Ionisation in a parallel 
plate capacitor. It is assumed that the 
capacitor has a width and length that 
are much greater than the thickness so 
we can ignore the effects of fringe fields. 


We will study the drift of charge carriers 
in gases in section 4.1, in liquids in 
section 4.2 and in semiconductors in 
section 4.3 in more detail. 


The readout chain comprises the de- 
tector, and subsequent analogue and 
digital electronics. 
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the charges move between two electrodes separated by a distance / the 
current is given by 


(3.2) 


where we are summing over the charge carriers, which have individual 
speeds vj. 

In this simple equation there will be two possible sources of fluctua- 
tions of the current. One is the fluctuation in the number of charge carriers 
and the other are the fluctuations in the speeds. Assuming these two ef- 
fects are uncorrelated, we can write the fluctuation of the current (i.e. the 


noise) as T- Ci ca 


Fluctuations in the number of charge carriers give rise to ‘shot noise’. 
Fluctuations in the speeds can arise from thermal motion and hence this 
type of fluctuation is called ‘thermal noise’. We call these types of noise 
‘random noise’ as the origin of these fluctuations is stochastic. A third 
contribution to random noise, which is not covered by the simple picture 
of eq. (3.2), is ‘1/f? noise. 

If we compare eq. (3.2) with the Ramo-Shockley theorem, eq. (3.1) 
it is obvious that similar fluctuations will be created in the detector itself. 
In addition, stochastic processes in the generation of the charge from the 
detector, some of which we have already discussed in chapter 2 (e.g. strag- 
gling distributions), can create very significant signal fluctuations. 

The above are the fundamental noise sources in our detectors and asso- 
ciated amplifiers but there can in addition be noise from external sources. 
Finally, in modern particle detectors the output of the readout chain is usu- 
ally digitised, which introduces additional deviations from the ideal signal 
called ‘digitisation noise’. 

In the following we will study each of these contributions in more de- 
tail. 


(3.3) 


Shot Noise 


Shot noise arises from the random fluctuations in the emission of charge 
carriers. Shot noise is usually small, but it can be a major noise source for 
small currents, low temperatures and high frequencies. 

If we consider the flow of electrons between two electrodes, for exam- 
ple in a pn junction diode (This can be a silicon detector, or a semicon- 
ductor component in an electric circuit), the emission of charge carriers 
across the junction will be uncorrelated. Generally, each emission event 
will create a signal h(t) with some finite time dependence. (For example 
h(t) = (U(t)/C)exp(—t/t), with U (t) a unit step function, and T a time 
constant given by complex impedances in the circuit.) and the combined 
noise signal is given by the superposition of these. To calculate the noise 
variance we use Campbell’s theorem [180], which states that this variance 
is given by the sum of the mean square contributions of all impulses pre- 
ceding in time. 
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Thermal noise is also sometimes re- 
ferred to as Johnson noise, after its 
discoverer. 


Because of the different origin of this 
noise (stochastic fluctuation in the 
ionisation charge generation) it is 
practical to treat this separately (‘signal 
noise’). 


A current of 1 A comprises 6.25 x 10'8 
electrons per second. Even fluctuations 
of several billion electrons per second 

will not have a large effect. 


The stochastic nature of the charge 
movement is important. For example, 
in a regular (i.e. ‘ohmic’) resistor, the 
movement of individual electrons is 
correlated, and hence shot noise is 
negligible. 


We are following here the derivation 
given in [413]. 
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We denote the contribution at time tọ from an impulse occurring at 
time t; as V;(to) = qeh(to — ti) = qeh(u), and thus 
(na? f Hudu, 


= eh (to — ti) 
where n is the rate of impulses. The transition to continuous variables in 
the last part is justified in the case of high rates, (n)t >> 1, which is the 
case for macroscopic currents. 
In the frequency domain the system response is given by the transfer 
function H(@), which is the Fourier transform of h(t). We can now use 
Parseval’s theorem, 


1) pS 2 1 7°” 2 
h? H(© do=— | H()|* da, 
f ? a= sf aK) T JO A ()| 


of = 2in)ae [HCP af. 


If we assume that this variance is the result of an input spectral density i2, 
which can be associated with a variance 


= | REAP 


we find by comparison for the spectral density of the shot noise 


*(to)) — w(t 


o? = (v 


to obtain 


df, 


È = 2lqe, 


using I = (n)qe. 

The spectral density for shot noise is independent of the frequency. 
Numerically, the power density for shot noise dP/df has a magnitude of 
3.2 x 107!9 W/AHz. As the noise power scales with J, the noise current i, 
scales with V7 , and therefore the relative noise current i,,/J is proportional 
to 1/ vI, and increases for smaller currents. 


Thermal Noise 


Thermal noise or Johnson-Nyquist noise is the result of variations of the 
velocities of the charge carriers and can be seen as the effect of Brownian 
motion of charge carriers. 

In a conductor of length / the average current due to the movement of 
one electron is given by Z(t) = qv(t)/l. The autocorrelation function for 
the current is then given by 


We will assume that electrons do not interact with each other, but will 
scatter with scattering centres in the band gap, with a probability given 
by exp(—t/T). The velocity will then de-correlate from its previous value 


at a similar rate, 
To v(t +r) = ye ll/t, 


(3.4) 
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Figure 3.2: Shot noise in the time 
domain. 


It is common to denote the spectral 
densities 5 
g = dvioise/df, 


i 


stb sty 


and we will adopt this convention, even 
though it leads to ambiguity for the 
current. 

In calculations, e, and i, can be used 
as voltages and currents at a given 
frequency. 


Noise with a uniform power spectral 
density is called ‘white’ noise. 


For a comprehensive discussion of 
thermal noise, including historical 
context, and a discussion of thermal 
noise at high frequencies and low 
temperatures, see [7]. 


The derivation here follows [74]. 
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Next, we will employ the equipartition theorem, which states that in equi- 
librium at a temperature T the energy stored in a degree of freedom is on 
average (1/2)kgT, hence for the kinetic energies of the moving charge 


carriers 

my? 1 
= —kpT. 
2° 2° 


Putting all this together we obtain 


2, 
(I(t)I(t-+1')) = TET Wie 
m 


For a continuous flow of a total number N = nAl electrons, where n is 
the electron density, and A the cross-section of the conducting channel, the 
autocorrelation function becomes 


Nq?kpT —|t|/t — nAq kgT —|t\/t 
=r R = ——e , 
ml? ml 


(ÐI +t) = 


Next, we will make use of the Wiener-Khinchin theorem [494, 305], 
which states that the power spectral density is given by the Fourier trans- 
form of the autocorrelation function. Here for the current this can be writ- 
ten as 

B =2F{ (I(t +t))}. 
In our case 


2T 
1+0? 


2_ 2nAqkeT >f yr) _ 2nAq?kpT 
a ml ie a ml 


In a typical conductor, T < 1 ps, whereas @ of interest is usually < 1 GHz. 
Hence w?t” < 1 and we can ignore this term. 

Finally, we use the Drude conductivity for a conductor, © = ng? T/m 
with the charge carrier density n, and replace oA// = 1/R to find 


n R 


(3.5) 


With P = U?/R = PR the noise current power density can also be ex- 
pressed as a noise power density dP/df = 4kgT or a noise voltage power 
density e2 = 4kg TR. The power spectra for thermal noise are also uniform 
in frequency and so are another example of white noise. At room tempera- 
ture dP/df ~ 1.7 x 107% W/Hz. 


1/f noise 
1/f noise or ‘flicker noise’ comprises a number of noise mechanisms that 
are not well understood, but display a similar spectral characteristic in that 
the power spectral density falls as 1/ f% with 0.5 < œ < 2. In electronics 
it is often caused by trapping of charge carriers or shows up as a resistance 
fluctuation. 

If the mean time constant for a trapping process is T, the resulting 
noise spectrum will have the form (see exercise 6) 


2 
2 Ao 


Ly = I+ (or)? (3.6) 
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We only are interested in the longitudi- 
nal degree of freedom here. 


An equivalent argument is that the 
current signals (and thus the autocor- 
relation function) are very short, hence 
can be described by 6-pulses, and the 
Fourier transform of a 6-function is 
uniform in frequency. 


Thermal noise can be reduced by lower- 
ing the temperature. Thermally induced 
signal fluctuation of the signals from the 
detector will also decrease for lower 
temperatures. Both are reasons, why 
ultra-low noise detectors operate at 
cryogenic, sub-K temperatures. The 
most advanced technologies are on the 
brink to become sensitive to quantum 
effects in the sensing and readout (see 
for example [501]). 


Such a spectrum is also referred to as 
‘pink’ spectrum. 


This type of noise is also called ‘random 
telegraph noise’ (RTN). 
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A combination of a few processes with different values of T can lead to an 
approximate 1/f spectrum (see exercise 6). 

The variance of the noise voltage due to 1/f noise does not scale with 
the bandwidth, but with the ratio of the high and low cut-off frequencies, 


Shi d ; 
o? oc af =In (=) : 
fio f fio 


This has the consequence that if the spectral response of the readout is 
limited by a simple bandpass filter (for which the ratio of high and low 
cut-off frequency is usually constant) the contribution from 1/f noise is 
usually independent of the centre frequency of the bandpass filter (which 
corresponds to the inverse of the time constant of the measurement). 

Unlike thermal or shot noise, 1/f noise is device technology depen- 
dent. For example, it is the major noise mechanism in MOSFETs at low 
frequency, where it is caused by defects in the surface layer of the conduc- 
tive channel. 


Interference noise 


The previous sections discussed random noise which originates in the 
physical process of charge conduction. As such, these types of noise are 
often referred to as irreducible. In addition, there can be large unwanted 
noise contributions caused by external sources that usually can be min- 
imised by proper design of the electronic system. 

Typical instances of interference noise are 


e External noise sources that couple an emf into your electronics by 
capacitive or inductive coupling; 


e Shared current paths on a common ground bus (‘ground loops’) that 
introduce shifts of the ground potential in a circuit. Ground loops 
are also vulnerable to stray AC fields, which will create an emf due 
to Faraday’s law; 


e Ripple on DC supply lines, often when the DC voltage has been 
generated from AC sources or by switching. 


Often, external sources generate noise at specific frequencies and the 
generated noise spectrum is non-uniform with limited bandwidth, and 
maxima at these frequencies (for example interference from power lines 
at line frequency). In multi-channel systems interference noise is often 
coherent. 


The good news is that interference noise can be prevented if the ac- 
tion of the external noise source on the system of interest is interrupted. 
Measures to prevent interference noise are 


e Shielding (Faraday cage): Shielding reduces external noise in 
two ways. First, it will reflect the incident electromagnetic wave. 
The reflected electric field is Erea = Eo(1 — Zshicia/Zo), with 
Zo = Ho/€o = 377 Q, the impedance of free space. The impedance 
of the shield Zpieiq 18 typically much lower than this, so most of 
the incoming wave is reflected. Secondly, shielding will attenu- 
ate the penetrating electric field. The absorbed wave gives rise to 
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A MOSFET is a widely used type of field 
effect transistor. The acronym stands for 
Metal Oxide Semiconductor Field Effect 
Transistor. 


Because of the correlation with external 
sources, this is appropriately called 
‘interference’, to distinguish it from the 
random noise discussed so far. 

emf or electromotive force is the electri- 
cal action produced by a non-electrical 
source (e.g. an external elecromag- 
netic field). Effectively, it is an induced 
voltage in the circuit. 


Coherent noise in a set of channels be- 
longing to a section of the readout (e.g. 
a chip or a module) is also sometimes 
called ‘common-mode’ noise. 
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a local current, which creates a field that counteracts the primary 
excitation. The net current decreases exponentially as the wave 
penetrates deeper into the medium, i(x) = ige~*/*, where 6 is the 
‘skin depth’, which, for frequencies much below (pe)~!, is given 
by ô = \/p/(afu) (In aluminium at f = 1 MHz the skin depth is 
about 85 um). Thus, in principle, shields can provide a very power- 
ful means to reduce interference, as long as holes in the shields with 
a size of A = c/f and larger, where f is the frequency of interest, 
are avoided. It is therefore important to shield all parts, including 
connector housings, and loss of conductivity across seams in the 
shield due to corrosion or insulating surface finish must be avoided. 


e Break ground loops: Avoiding ground loops has two benefits: it re- 
duces the risk of shared current paths and it removes possible in- 
ductive loops. The best topology for ground connections is a star 
configuration, where ground is distributed from one, well-grounded, 
central point. 


e Minimise the size of inductive loops: Route signal and return lines 
closely together. Twisted wires will further improve inductive pick- 
up as the field direction is reversed in consecutive twists. 


e Low-impedance ground connections: High conductivity ground con- 
nections will reduce voltage drops. Be careful about inductances, 
which can introduce serious impedances at frequencies of interest. 


e Ferrite sleeves or beads: These are made of materials that can be 
magnetised, but are not conductive. In close proximity to a conduc- 
tor, they will create a high impedance for high frequency signals, 
and thus create a low-pass filter. They are therefore useful for wires 
from power supplies for which we want to filter out high frequency 
noise over a broad frequency range. 


e Use differential signals and receivers with high common mode re- 
jection ratio (CMRR). Many external types of noise will generate 
similar currents/voltages in the two conductors, which then cancel 
in such a receiver. 


e Manage return current paths: Keep digital and analogue return paths 
separate. This is to minimise cross-talk from the large digital sig- 
nals into the small analogue signals. The most sensitive part of the 
circuit is typically the connection of the sensor/detector to the first 
amplifier (preamplifier). 


In the subsequent discussions in this chapter we assume that interfer- 


ence noise has been minimised by proper design. 


Digitisation noise 


It is standard practice to convert the output of modern particle detectors 
into digitised data. Major benefits of this format are the ease of high- 
volume storage, good replicability, and accessibility for processing in 
computer-based offline analysis. However, these benefits come at the cost 
of a reduction of information contained in the analogue output signal from 
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Here p is the resistivity, € = €€ is 
the permittivity, and u = Uy 4o is the 
permeability of the conductor. 


In a colliding beam experiment it is 
essential to provide good shielding from 
the large currents caused by the beams. 


Typical cut-off frequencies for ferrites 
are about 10 MHz. 
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the detector. The key challenge in designing the signal processing for a 
particle detector system is that this data reduction does not unduly reduce 
the ability of the detector to measure the properties of interest. 

A common feature of different digitisation processes is the mapping 
of a continuous variable onto a digital scale, which increments in discrete 
steps. This means that the output of the digitisation will differ from the 
analogue value, except in the centre of the interval. The distribution de- 
scribing the error has a square distribution and the average error can be 
quantified as the RMS of that distribution. 


RMS, = 4/ (t?) — (t)? = 
=} ar 1 fT/2 R 
FF a- (F fira) 
T/2 T J-r/2 
where T is one digitisation interval. 

It is obvious that a small digitisation step size reduces the digitisation 
error. However, a small digitisation step size requires a larger number of 
bits to cover the range to the largest signal to be digitised. The ratio be- 
tween the largest and the smallest signal that can be digitised is called 
the ‘dynamic range’ (DNR) and can be given in number of bits or in dB. 
The smallest measurable signal is 1 Least Significant Bit (LSB). In a lin- 
ear system the DNR is then given by the number of bits available for the 
digitisation. 

Often, a signal is not only digitised into a single number (like for ex- 
ample the peak voltage, or the charge in the signal), but it is sampled 
at regular intervals to obtain a digitised image of the signal in the time 
domain. Sampling again introduces a loss of information, this time the 
knowledge of the signal in between the sampled times. Consequently, sig- 
nals with a frequency above half the sampling frequency will appear to 
have an unphysical frequency (‘aliasing’). 

The frequency distribution of the amplitude digitisation noise is gener- 
ally not trivial. If the signal is uncorrelated with the sampling, it will have 
a constant spectral density between f = 0 and f = fy. However, if the sig- 
nal is periodic, the quantisation noise can become periodic, too. Dithering 
is a method to effectively randomise the phase, and equalise the spectral 
density. 

The obvious way to reduce digitisation errors is to increase the 
DNR and, in sampling readout systems, to increase the sampling fre- 
quency. However, there are often practical limits (size of data storage, data 
throughput rates, speed, cost, etc.) in this optimisation. 


T 
= —— ~ 0.297, 
v12 


3.3 AMPLIFICATION 


Due to the small amount of charge generated in a particle detector almost 
all electronic processing of particle detector signals requires electronic 

amplification of the signal. The detector usually provides us with a signal 
made up of charges, which, if their arrival is staggered in time, can be de- 
scribed as a current. However, we prefer as output of the analogue signal 


(3.7) 
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This is an error that does not follow a 
Gaussian distribution. This needs to 
be taken in consideration if the error 
needs to be combined with other error 
contributions. 


Output 


Input 


«— > Output- Input 


Figure 3.3: Ideal digitisation charac- 
teristics (top) and digitisation error 
distribution (bottom). 


In dB, 
DNR = 20log;0 x 
x (2”bits — 1) dB œ 6.027pits dB. 


We call fy = fs/2, where fy is the 
sampling frequency, the Nyquist fre- 
quency. The reconstructed frequency 
for a signal frequency fo > fy is 


fo = fu — (fo — fx) =fs— fo 


In dithering a small amount of random 
noise is added to the signal. 
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processing a voltage signal. We therefore connect our particle detectors to 
an electronic amplifier, which amplifies input charge or current to a volt- 
age output. 

As we have seen, we typically require electric fields to collect the ion- 
isation charges created by the interaction of the incoming particle in the 
detector material. We can therefore describe the detector as a capacitive 
source, made up of an ideal current source is, with a parallel detector ca- 
pacitance Cy. Here, we will assume that the current is constant, and flow- 
ing during the charge collection time f,. If we connect this source to an 
amplifier with input resistance Rin, we can distinguish two limiting cases. 
If RinCa > tc, the charge on the detector discharges slowly through the am- 
plifier input resistance, and Vout = Voexp(—t/(RinCa)), with Vo = Qs/Ca 
and Q, = f i, dt. In this case the amplifier input is given by the voltage cre- 
ated by the charge across the detector capacitance. On the other hand, if 
RinCa < tc, then the capacitor discharges quickly, and the output voltage is 
proportional to is. 

Today by far the most widely used electronic amplification devices 
are transistors. These are three-terminal components, where a charge (or 
a current) on the centre terminal controls the flow of current between the 
outer terminals. Two major types exist. Bipolar junction transistors (BJTs) 
consist of two back-to-back diodes that are connected to a common thin 
region called the base. The emitter-base junction is forward biased and 
the collector-base junction is reverse biased. A current injected into the 
base, the terminal connecting to the doped area between the two junctions, 
allows for a current to flow through the diode junctions, even though one 
of them is in reverse bias. BJTs are designed so that a small base current 
controls a large current between the collector and the emitter. 

The second type of transistors are field effect transistors (FETs), in 
which the width of a conducting channel between the outer terminals (here 
called the drain and source) is controlled by a charge close by, which is 
brought to the transistor on the centre terminal (here the gate). In an FET 
the gate is insulated from the drain-source current flow, so that very little 
or no current is flowing into it. There are two types of FETs: In junction 
FETs (JFETs) the gate isolation is achieved by a reverse-biased diode 
junction, whereas in metal oxide semiconductor FETs (MOSFETs) a 
thin isolator, typically SiOz, isolates the gate. Complementary MOS, or 
CMOS, is a widely available industrial fabrication process that uses com- 
plementary and symmetrical pairs of p-type and n-type MOSFETs for 
logic functions, but is increasingly used for analogue applications due to 
the small feature size available and its affordability. 

While these devices provide the amplification we need, a number 
of performance shortcomings (non-linearity, non-ideal input and output 
impedance, the need to provide a very stable bias voltage to achieve a de- 
fined gain, and a current instead of a voltage output) make these devices 
poor stand-alone amplifiers. Practically, we therefore embed them in more 
complex amplification circuits, of which the most powerful is the feed- 
back amplifier in which the gain is stabilised by feeding back part of the 
output signal to the input. As the fraction of the signal fed back is usually 
controlled by passive components, it is these components that control the 
gain, and not the more complex transistor components. 
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An ideal voltage generator has negligi- 
ble output impedance. 


b- o o 


Figure 3.4: Capacitive source and 
amplifier. 


An alternative description that is often 
used is that a small change in base- 
emitter voltage causes a large change 
in current, i.e. the device has a large 
transconductance, gm. 


The MOSFET is the most copious de- 
vice produced in history: An estimated 
10° MOSFET transistors have been 
produced in the past 60 years, mostly in 
digital ICs. 


CMOS gates have the very important 
advantage compared to BJTs that the 
power consumption is very low when 
the gate is not switching. 
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For the amplification of particle detector signals the most useful feed- 
back amplifier circuit is the charge-sensitive feedback amplifier. 


We assume an ideal inverting amplifier with infinite input impedance 
and a large ‘open-loop’ gain —A, so that Vout = —AVvin and consequently 
the voltage across Cy is vf = (A + 1)vin. Hence, the charge induced on Cf is 
Of = Creve = Cy(A + 1) vin. Because of the infinite amplifier input impedance 
Qf = Qin, and thus the input capacitance of the circuit Cin = Qin/Vin = 
C(A +1). 

The charge gain of the amplifier is the output voltage for a given input 
charge, and is given by 


Avin All 
A+1Ce Cr 


Vout 
Qin 


where the last approximation holds for A > 1. 
The signal charge Q, is divided by the capacitive divider Ca and Cin, so 
that the fraction of the charge supplied to the amplifier is 
Qin Cin 
Qs CatCn 


Ag (3.8) 


CinYin 


To effectively couple the charge into the amplifier Cin should be much 
larger than Ca. In that case the input charge to the amplifier is independent 
of Ca: 

For practical purposes a discharge path for the feedback capacitor Ct is 
usually required. This can be achieved by a parallel resistor Rẹ or switches. 


CASCODE AMPLIFIERS 


To achieve the desired signal gain before further signal processing mul- 
tiple gain stages are needed. In order to minimise the effects of noise in 
the second or subsequent stages, we require a large gain in the first stage 
(usually called the ‘pre-amplifier’). 

The gain of a single transistor common source FET amplifier is given 
by (see exercise 4) 


an EmRL 


A= , 


(3.9) 


where ro is the small signal output resistance of the transistor, 

ro = dVq/dlgs. Typically, ro is between 100 kQ and 1 MQ, and gener- 
ally ro > R. The output impedance of the common source amplifier is 
ro||RL & RL. To increase the gain we need to maximise the transconduc- 
tance gm, and ro. 
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Also known as the ‘charge-integrating 
amplifier’. 


Figure 3.5: Conceptual charge-sensitive 
feedback amplifier. 

A simple implementation uses op-amps, 
but similar circuits requiring fewer 
transistors exist (e.g. the complementary 
cascode, see for example [414]). 


R; will discharge the capacitor with a 
time constant of T = RyC}. 


-——_»—0 A 
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V; Vout 


Figure 3.6: Circuit diagram for a single 
FET common source amplifier. The 
voltage divider Rı/R, sets the gate 
voltage. This is called a common source 
amplifier, as the source is connected to 
both input and output. 


We will see in section 3.6 that we want a 
large value of gm to optimise the S/N. 
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An improved amplifier circuit to achieve a large output impedance 
Fo is the ‘cascode’ configuration, comprising two FETs. A cascode con- 
sists of a grounded common source FET amplifier (the lower FET in Fig- 
ure 3.7), which is loaded with a common gate amplifier (the upper FET in 
Figure 3.7). 

The output impedance of the cascode is given by (see [464] and exer- 
cise 4) 


Fo = Fo,1 +102 + Fo,1 Fo,2 8&m,2, (3.10) 


where ro,1 and ro, are the output resistances of the two transistors, respec- 
tively, and gm2 is the transconductance of the second transistor. Therefore 
the cascode circuit can have the same input impedance as the single tran- 
sistor but a significantly higher output impedance and thus increase the 
gain. 


3.4 NOISE IN DETECTOR AND AMPLIFIER SYSTEM 


We can now combine the results from the previous sections to consider the 
noise of the combined system of a detector and electronic amplifiers. 

First, we will show that the S/N in an amplifier chain is dominated by 
the noise from the first amplifier in the chain. If you have a chain of am- 
plifiers, each with a gain A; and an associated input noise Nj, amplifying 
a signal S, then in the first stage both the signal and the first stage input 
noise get amplified by the same gain and the output signal-to-noise ratio at 
the output of the first stage is S/N. After the second stage 


S (SA\As)2 S 1 


Mo \ MAANA) Mal ES 
NA} 


If the gain of the first stage is reasonably high, the square root will tend 
to | and the signal-to-noise ratio after the second stage is the same as for 
the first stage. In an amplifier chain the signal-to-noise ratio is therefore 
dominated by the first stage (preamplifier), and we will in the following 
concentrate on this. 

A series resistor between the detector and the preamplifier will gener- 
ate thermal noise as discussed in section 3.2. This will appear as a voltage 
source in series with the preamplifier and is therefore called series noise. 
Similarly, internal noise in the preamplifier, like for example from the in- 
put transistor in the preamplifier, will contribute to the series noise. 
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Figure 3.7: Circuit diagram for a 
cascode FET amplifier. The circuit can 
be further improved by replacing the 
load resistor Ri, with a current source. 


For a voltage output the output 
impedance is ideally small. This can 
be achieved using a subsequent buffer 
amplifier, for example an emitter (BJT) 
or source (FET) follower. 


Another benefit of the cascode is that 

it has a smaller input capacitance as it 
avoids the Miller effect. Thus it has a 
high bandwidth (see for example [289]). 


N, N, 
oD 2> S - 


Figure 3.8: Noise in an amplifier chain. 


se 
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Let the total series noise have a spectral density en. We will now con- 
sider the combined circuit of the detector and a charge integrating ampli- 
fier as introduced in section 3.3. We model the amplifier as ideal (i.e. no 
noise) because we have included the amplifier noise in ep. 


n,out 


Taking the preamplifier to be an ideal op-amp, no current flows into 
the inverting input, so that the impedances of the two capacitors constitute 
a voltage divider for en,out. Using the capacitor impedance Zc = (@C JL 


€nout _ (@Cq)~! +(@C;)"! j4 Ca 
= =14 ; 
enc (@Cq)~! Cr 
enc = Cn + nin and en,out = —A@n,in, and thus 


—o (14141 T! 
en = en AAG ~ en, 


where the approximation is valid for A > 1. In other words, the noise on 
the voltage across the capacitor is given by the series noise. 

Using the charge gain Ag of the feedback circuit (eq. (3.8)), the equiv- 
alent input noise charge is 


en out y Â 
Qnin = crow = €n,out V AfCt eny Af (Ca +Ct), 


where Af is the bandwidth of the readout system, over which the noise 
is integrated. It is obvious that to reduce the noise charge Cp should be 
small, and ideally we can ignore it compared to Ca. In that case the equiv- 
alent noise charge is proportional to the detector capacitance. As seen in 
the previous section the signal charge is independent of Cy, and thus the 
charge S/N is proportional to 1/Cg. The exact result can be different for 
other preamplifier configurations but the resulting signal to noise always 
shows this dependency. 

But the series noise from a series resistor and from the amplifier are 
not the only noise sources for the readout of a particle detector. We can 
add several other noise sources. 


R 


Ss 


1 C, 


First, we can add ‘parallel noise’ from current sources that are parallel 
to the detector capacitance. Two major sources for this type of noise can 
be present: amplifier parallel noise (with a current power density Rap) and, 
in particular in semiconductors, shot noise due to sensor leakage current 


(3.11) 
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Figure 3.9: Detector and preamplifier 
circuit model with series noise. 


That result also was valid for A> 1. 


Note that the capacitor itself is noise- 
less, the result is a consequence of the 
capacitive divider circuit. 

This clearly gives one motivation for 
using small areas for the sensitive 
elements in particle detectors, in par- 
ticular semiconductor detectors. Other 
compelling reasons are the desire to 
improve spatial precision and reduce 
the rate of double hits in one element. 


Figure 3.10: Detector and preamplifier 
circuit (left) and a more complete noise 
model (right). 


This is shot noise with a power spec- 
trum as described by eq. (3.4). 


58 


Teak. We assume that the input impedance of the (ideal) amplifier and any 
other parallel resistances are infinite, and thus the only impedance through 
which the current can flow is the detector capacitance, generating a volt- 
age in /(@Cq). Hence, the noise voltage power density at the amplifier in- 
put due to this current noise is 


5 i 2qel 


me (OC (0a) 

Finally, we can also add a contribution from 1/f noise in the input 
transistor of the amplifier, ei I7 Aif /f, where A; /f 18 an appropriate 
proportionality constant. 

We can assume that the different noise sources are uncorrelated, so we 
can add them in quadrature to get the noise spectral density at the pream- 
plifier input 


2 


ees 2 2 2 pe 
en.in — &n,s + en,as + en, /f ren] + en ap = 


1 
= 4kpT Rs + ef as +2HA1/p—+ (2qeh +i ap) az (3-12) 
— U VW GA (Q) 
es B 
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where we have collected different contributions according to their depen- 
dency on the frequency. 


3.5 PULSE SHAPING AND SHAPING TIME OPTIMISATION 


We have seen that the spectral density of random noise is difficult to re- 
duce. However, the total noise is given by the product of the spectral den- 
sity and the bandwidth, and hence the most efficient way to reduce the 
effect of random noise is to reduce the bandwidth of the readout system. 

The signal processing tool to limit the bandwidth at the low and the 
high frequency end is a bandpass filter. Many implementations of ana- 
logue bandpass filters exist, but for the discussion here we will focus on 
the conceptually simplest implementation, the buffered RC-CR filter, 
which comprises a first order low-pass and a first order high-pass filter, 
each with a cut-off frequency @ = (RC)~', separated by a unity gain 
buffer amplifier. The purpose of the buffer amplifier, which we assume 
has an infinite input impedance, is to prevent the output of the first stage 
‘loading’ the second stage. While in principle we could use different val- 
ues of R and C in the two stages, the choice of the same values simplifies 
the calculations, and a more general analysis shows that this choice is also 
optimal for maximising the S/N. 


in out 


A useful tool to analyse analogue filters is the Laplace transform. To 
find the response of a filter in the time domain, we need to find the trans- 
fer function of the filter, which is defined by the ratio of output to input 
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Additional components in a real detec- 
tor circuit might need to be added, but 
by now it should be clear how this can 
be done. Ca, for example, will include 
other parallel capacitances, etc. 


It is interesting to note that the ca- 
pacitance Cy converts the white noise 
spectrum of the parallel shot noise into 
an effective 1/ f? spectrum. 


Remember that for the charge integrat- 
ing feedback amplifier with the usual 
assumptions (A > 1 and Cy > Ct), 

2 22 
Cnour = (Ca/Ct) enin 


Obviously this needs to be done in a 
way that does not reduce the signal 
spectrum more than the noise spectrum. 


More complex bandpass filters can 
achieve slightly better S/N, but the 
basic discussion is the same. 


Often the low-pass and the high-pass 
filter are labelled as integrator and 
differentiator, although this is strictly 
speaking not correct, as these circuits 
would only have the frequency char- 
acteristics of an actual integrator or 
differentiator if R > 0. 


Figure 3.11: Simple RC-CR bandpass 
filter with unity gain buffer amplifier. 
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voltage as a function of complex frequency s. We then multiply it with the 
Laplace transform of the signal, and then calculate the inverse transform to 
obtain the result in the time domain. 

The transfer function for this RC-CR filter is given by the product of 
the transfer functions for the two parts, Hrc(s) x Hcr(s). These transfer 
functions can be found using the voltage divider rule, 


H(s) = HArc(s) x Her(s) = 
er R 
“REGO R46C) 


sRC 
= . l 
(1 +sRC)? one) 


Next, we need the Laplace transform of the signal. As we have seen, 
the exact shape of the signal depends on the charge collection in the de- 
tector, and the time constant of the preamplifier. However, it is common 
to consider the response of the filter to a step function. The Laplace trans- 
form for a step function is of the form 1 /s. If the total charge of the pulse 
is q we can model the voltage output of the preamplifier as q/(C;s). 

Combining this input function with the transfer function we can write 
down the output for the signal pulse as 


qRC 


aa )= 
Cr(1 +sRC)2" 


out (3.14) 
We now use the inverse Laplace transform to determine the output 
pulse in the time domain. This is discussed in exercise 2. The result is 


ignal qt 
signa (t) Ag e 


ot = Crt (3.15) 


where the time constant T = RC. 

We can differentiate this equation to see that the maximum of the pulse 
occurs at a time fax = T and therefore the maximum signal amplitude (in 
terms of charge) after the shaper is gexp(—1) ~ 0.368g. 

It is the modification of the signal shape in the time domain that leads 
to the designation ‘shaper’ for such a circuit in detector signal process- 
ing. In addition to limiting the bandwidth a shaper can also be useful for 
baseline restoration (i.e. resetting the output to zero after a suitable time). 

After studying the effect of the shaper on the signal we will now study 
its effect on the noise, which is given in eq. (3.12). We obtain the total 
RMS noise after the shaper from 


co m C 2 
2 2 2 d 2 
Vams = | en shaper CO = I enin (2) |H(@)| do, 
0 0 f 


where we are using the transfer function in the frequency domain, 


10T 
HO) Gene 


From comparison with eq. (3.12) we obtain 


T 


C2 
2 2 d 2 :2 
VRMS => 402 (ai +4Aı/fCa +a) ‘ (3.16) 
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The Laplace transform is a general- 
isation of the Fourier transform. It is 
a transformation into a complex fre- 
quency space 


F(s) = 21s} = f FO a, 


with s = œ +i@. a is sometimes called 
the Neper frequency and is a measure 
of how fast the transient response of the 
circuit will die away after the stimulus 
has been removed. It is related to the 
attenuation of a damped RLC circuit. 

@ is the standard angular frequency. 


Such a situation is not uncommon, for 
example, the current pulse in a silicon 
detector is very fast (typically lasting 
O(10 ns)) and if we integrate this with 
a charge sensitive amplifier with a large 
time constant RinCg, the input to the 
RC-CR filter will be a step function. 
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Figure 3.12: Transfer function of the 
buffered RC-CR filter in the frequency 
domain. 
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t/T 
Figure 3.13: Response of buffered 
RC-CR filter to a step function input in 
the time domain. 


Noise is a continuous phenomenon 
and thus we can drop the aperiodic 
time-dependent effects of the shaper. 


The required integrals are 
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We see that the contributions from the different noise sources have 
different dependencies on the shaping time constant T. The parallel noise 
contribution increases with shaping time. This is what we would expect 
because as the shaping time increases more charge from the leakage cur- 
rent accumulates, and we expect the Poisson fluctuations to increase lin- 
early with v/T. The series noise contributions scale with 1 /t because the 
series noise is uniform at all frequencies, and if tT decreases the absolute 
bandwidth increases and the noise is integrated over this larger bandwidth. 
Finally, the 1/f noise contribution does not depend on the shaping time, 
as we have seen before (see section 3.2). We can also see that the series 
and the 1 /f noise contributions increase for larger detector capacitance. 

We can combine the results so far to find the equivalent noise charge 
(ENC). Assuming that we sample the signal from one electron then after 
the shaper the signal peak is qe/(eCf). Combining this with the noise at 
the output of the shaper (eq. (3.16)) we find that the squared value of the 


ENC is given by 
2 C2 
2_ 8% f 2 ba 232 
(ENC)? = Ta (ee E FAC tint J). (3.17) 
Ye T 
a , 10* 
ene BED - Ss VOLTAGE NOISE CURRENT NOISE L ; 
maa WHE A ne 
Se 1 a Rp 
Gaseous detectors mae Parallel noise = 410° 4 L 
Si strip detectors === Series noise 1 O x 
SN C~10pPRe a 1/f noise o if 
“NAT Total noise 4 fe) 
~T z 
a 5 1 
` J fa 10-4- se ee 'na 
Ia F = ge R 
CCDs*~., ee ee s 
Si drift detector“ 4 3 i 
C, ~ 0.1 pF a 
1 10' | ee ere ee 
10° 1 10° 10° 0.01 0.1 1 10 
Shaping time [us] PEAKING TIME (us) 


Figure 3.14: ENC as a function of the shaping time. Left: For different detector systems (after [413]). Right: For a typical silicon detector 
(la = 10 nA, inp = 0.2 pA/V'Hz, enas = 5 nA/V/Hz, Ap = 1071! V?, Ry = 400 Q, Cy = 10 pF) [463]. 

If there are no other system constraints then we can find the optimal 

: : é Sais . Other considerations, like for example 

shaping time, Tin, by differentiating eq. (3.17) with respect to T (exer- the need for fast baseline restoration in 
cise 5). The result of this calculations is that the optimal shaping time is high-rate experiments, can affect this 

è . . . . optimisation. 
achieved when the series and parallel noise contributions are equal. 


3.6 TRANSISTOR NOISE 


So far our discussion of noise optimisation was based on an ideal ampli- 
fier, for which we described the amplifier noise performance using generic 
series and parallel noise contributions. We will now investigate in slightly 
more depth the noise contributions from real transistors. The noise sources 
in BJT and FET amplifiers are very different, so we will discuss them sep- 
arately. 


Bipolar junction transistors 


The base current J, in BJTs in general is comparatively large. This results 
in shot noise due to the base current with a spectral power density (using 
eq. (3.4)) 

ey = 2qelp. (3.18) 
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This noise current is in series with the noise from the detector leakage 
current so the two noise contributions add up for the parallel current noise. 

The shot noise due to the collector current can be referred back to an 
equivalent series voltage noise density at the input, ey in, using the stan- 
dard bipolar junction transistor relation ic = gmVp. It is given by 


ees 2(kpT )2 
2 -() sf = TT 


(3.19) 
dele dele 


The noise optimisation for bipolar junction transistors now has a dif- 
ferent outcome. If we increase J, we increase gm and therefore decrease 
the voltage noise (eq. (3.19)). However, this will imply an increase in the 
base current h, and hence an increase in the current noise (eq. (3.18)). 
The result is that we can maintain the same ENC as the shaping time T is 
changed, provided we adjust K. 


MOSFETs 


For MOSFETs the gate leakage currents are generally negligible so there 
is no contribution to the current noise. The conducting channel of the 
MOSFET acts like a resistor and therefore there will be thermal noise. The 
MOSFET channel is not uniform over it’s length and the calculation of 

the thermal noise is more complicated, but it will still act like a resistance 
R œ 1/gm. Therefore the spectral density of the resulting noise current can 
be written as 


in = Wngm4keT, (3.20) 


where % is a constant of the order of unity. 


It is convenient to refer this current noise power density to an equiva- 
lent voltage noise power density at the input of the transistor 


n,in 
; ia 


(3.21) 


For MOSFETs the variation of drain current with gate voltage is given 
approximately by 


la = k(Vgs — Ve)’, (3.22) 


where k is a constant which depends on the device geometry and the ca- 
pacitance, and Vr is the threshold voltage. Therefore the transconductance 
for a MOSFET is 


dla 


= —— = 3.23 
i (3.23) 


2m 2k(Vgs — Vr) = 2V/kla. 


Substituting this into eq. (3.21) we find 


2kgT 

2 B 

— i 3. 4 
e= ý; JE (3.24) 


3.7 NOISE SYSTEM OPTIMISATION 


Combining the results from the generic noise analysis and the noise in 
MOSFET and bipolar junction transistors we can now consider the overall 


61 


In the Ebers-Moll model of a BJT the 
transconductance is given by 
dk qe 


m= mM aTr” 


where the derivative and I, are eval- 
uated at the working point. At room 
temperature for a BJT a good approxi- 
mation is 8m ~ Ie / (25 mV). 


For JFETs the leakage currents are not 
negligible. 


The value of } = 2/3 is often used. 


This assumes that the MOSFET is 
operated in saturation (Vy. > Vr). If 
we are trying to optimise the S/Nat 
low power then it is advantageous to 
operate in a different mode. 


A very simple justification of this 
‘square-law’ form is given in [289]. 
An excellent full first principles physics 
derivation is given in [406]. This text 
clearly explains the approximations re- 
quired to derive this result and also in- 
cludes a full derivation of the variation 
of Iq with Ves when these simplifying 
approximations are relaxed. 
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system optimisation. This optimisation is different for the two transistor 
technologies. For BJTs, we found that we can decrease the voltage (series) 
noise by increasing the collector current (eq. (3.19)), but this increases 

the current (parallel) noise (eq. (3.18)). Therefore the optimal noise using 
a bipolar transistor can be found for any shaping time by adjusting the 
collector current. However, in practice this might result in an unacceptably 
large current and hence power consumption. 

The noise minimisation analysis is different for CMOS transistors 
because the current noise only arises from the detector shot noise. This 
means that the simple optimisation of eq. (3.17) considered in exercise 5 
is valid. From eq. (3.24) we can see that improved noise is obtained by op- 
erating at high currents, /g. Operating at higher currents obviously implies 
higher power consumption. 

Increased power consumption might not be an issue for a single chan- 
nel detector, but it is critical for large tracking detectors used in collider 
experiments. If the power consumption is larger, more metal (usually cop- 
per) conductors will be needed to transfer the current from the power sup- 
plies to the on-detector electronics. In addition, an increase in electrical 
power will require more substantial cooling circuits to remove the heat. 
All this will introduce more passive material in the detector, increasing 
multiple scattering for charged particles and hence degrading the resolu- 
tion for momentum and impact parameters. The additional material will 
also cause more electron bremsstrahlung impairing the measurement of 
electrons in the electromagnetic calorimeters. 

In section 3.5 we saw that we can select the optimal shaping time Tmin 
to obtain the minimum ENC. Such an optimisation would be acceptable 
for a low-rate detector but not for a detector in a high-rate environment. 
For example, at the LHC, we have collisions every 25 ns. The extra colli- 
sions that occur in one bunch crossing on top of the interesting triggered 
events create background particles (‘pile-up’). Therefore to minimise pile- 
up at the LHC we need to keep the shaping time less than about 25 ns. 

All these constraints need to be very carefully considered in the overall 
detector optimisation. Faster pulse shaping to minimise pile-up leads to 
an increase in noise. For CMOS transistors, the noise can be reduced by 
increasing J4, but this increases the power consumption and requires an 
increase in passive material. As common in detector physics there is no 
simple optimisation and the physics requirements for particular detectors 
will lead to different solutions. 


3.8 DIGITISATION 


The original output of any particle detector will be an analogue signal gen- 
erated by the passing particle, possibly amplified internally and by a sub- 
sequent pre-amplifier. For processing of this data, relevant features need to 
be extracted and converted into digital information. These features can be 
the presence of a signal (analogue-to-binary conversion), the measurement 
of the arrival time of the signal, the magnitude of a signal (analogue-to- 
digital conversion), or its complete time structure (sampling). 
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For the LHC phase 1 silicon detec- 
tors, there was an advantage in terms 
of noise at low power using bipolar 
junction transistors for the front-end 
amplifiers. On the other hand, the 
greater density available with CMOS 
technology was required for the digital 
circuitry. Therefore mixed technology 
BiCMOS processes were used in some 
applications. With the improvement in 
CMOS technology this advantage is less 
compelling and pure CMOS technology 
is being used for the ATLAS and CMS 
trackers for the HL-LHC upgrade. 
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ANALOGUE-TO-BINARY CONVERSION 


In the simplest case the information required from a particle detector is a 
binary hit/no-hit decision, for example, if the information is to be used in a 
coincidence in an experiment trigger (see chapter 13). The binary signal is 
obtained as the result of the comparison with a threshold. 

In general, the output of the detector will also carry noise and non- 
physics signals, for example from thermal excitations in the detector. 
These will also fire the comparator if they are above threshold. The first 
task will therefore be to select a suitable digitisation threshold that max- 
imises the number of detected real signals, while rejecting most of the fake 
ones. Both signal and backgrounds will increase as the threshold is low- 
ered. For a working detector the efficiency for the signals will be high for 
thresholds, where the rate of accepted background events is still small. 

As the rate of real signals is defined by external factors the accepted rate 
is constant once the comparison is fully efficient, and will only increase 
again when significant amounts of background are accepted. The graph 
of rate of accepted hits shows a ‘plateau’, and the threshold is usually set 
within the plateau. A better way to set the threshold is if we can identify 
signal and noise events (for example using other detectors). The threshold 
can then be optimised to obtain a high efficiency with low noise rates. 


ELECTRONIC TIME MEASUREMENTS 


While in many cases it is the magnitude of an electronic signal that is re- 
quired from a detector, there are instances where we want to know the 
time interval between a start and a stop signal. The electronic device for 
this purpose is a time-to-digital converter (TDC). First, the start and stop 
signals are converted into a binary signal using a comparator. In the sim- 
plest case the TDC then counts the number of cycles of a free-running 
clock which occur between the leading edges of the binary start and stop 
signals. 

If the times to be measured are too short to count the periods of a 
clock, ramp TDCs can be used. In these devices a capacitor is charged 
during the time between the leading edges of the binary start/stop signals. 
The capacitor can then be discharged at a slower rate and the time for this 
discharge measured with a slower clock. 

Another method for the measurement of short intervals is the Vernier 
TDC, where two slightly detuned clocks are started by the leading edges 
of the start and stop signals. As the clocks are running, they will reach a 
point where the leading edges of the clocks coincide and this can be used 
to determine the interval between them with high precision. If the first 
clock has until then counted nı cycles with a frequency fı, and the second 
m cycles with a frequency f2, then the time between the start and stop 
signals is given by 

nz—1 


fi fr 


Like in every other digitisation operation the error introduced by the 
digitisation is given by the smallest digitisation step divided by y 12, i.e. 
for a simple clock-counting TDC with a cycle period of T the average 


ram! 
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Electronically, this can be done with 
specialised ‘comparator’ integrated 
circuits. A slightly more robust com- 
parator circuit is the Schmitt trigger 
(see for example [289]). In time-critical 
applications constant fraction discrim- 
inators (see below) can be used for the 
digitisation. 


Sometimes the term ‘discriminator’ is 
used instead of comparator. 


A 


full efficiency 
for real 


rate of accepted 


“threshold 


Figure 3.15: Schematic plateau curve. 


A comparator outputs a voltage cor- 
responding to logical 1 or “on” if the 
voltage is larger than a threshold volt- 
age, or a voltage corresponding to 
logical 0 or “off” if it is below. 


At this point the device is a time- 
to-amplitude converter (TAC). The 
first TACs have been developed by 
Bruno Rossi to measure the muon life- 
time [434]. 
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RMS error is given by T / V12 (with a square distribution). Other errors, 
for example from electronic noise, will be added on top of this. 

One common issue with start and stop signals with finite rise time 
and varying amplitude is ‘time walk’. A regular comparator fires at a 
fixed threshold, and this means that signals cross the threshold at differ- 
ent times, depending on their amplitude. A clever way to overcome this, 
in particular for very fast signals as typical in nuclear and particle physics, 
is the constant fraction discriminator (CFD). Here an inverted and delayed 
copy of the signal is added to the original signal. The delay is chosen so 
that the delayed signal starts approximately at the maximum of the origi- 
nal signal. The discriminator then fires at the point of zero-crossing, which 
will always be at the same time. For signals with a length of a few ten to 
hundreds of ns the delay can conveniently be achieved by a simple cable 
of appropriate length. For fast particle detectors the delays of @(1 ns) can 
be created using a chain of gates on the readout ASIC. 


ANALOGUE-TO-DIGITAL CONVERSION 


Conversion of the magnitude of a signal (typically a voltage) to digital 
data is a common task in the processing of sensor outputs, and a wide 
range of different technologies have been developed for this (see, for ex- 
ample, [402]). These technologies differ in speed, dynamic range, linear- 
ity, noise performance, power consumption and cost. 

One of the conceptually most simple, and thus very fast, conversion 
technologies is the Flash-ADC (FADC). It is thus the technology used in 
ultra-fast sampling ADCs (up to GHz). 

In the FADC the analogue signal is compared to an array of threshold 
voltages that are created by a voltage divider chain from a reference volt- 
age. The comparison is done by comparators, the output of which is then 
encoded to digital information. 
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The strength of this technology is that the result of the comparator 
chain is instantly available and the digitisation time is only given by the 
bandwidth of the comparators and the time needed to encode the output 
of the comparators. The challenges are that the effort (number of compo- 
nents, cost, etc.) grows exponentially with the dynamic range of the ADC. 
The most serious performance limitation for flash ADCs is the linearity 
of the threshold voltages in the comparator chain due to variations of the 
resistance values in the resistor chain, and, in high speed applications, the 
propagation of the comparator outputs through the encoding logic into 
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Figure 3.16: Time walk. 
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Figure 3.17: Operating principle of a 
Constant Fraction Discriminator. The 
dark curve is in both cases the sum of 
the original signal and the inverted 
and delayed copy (grey). Note that for 
both pulseheights the time of transition 
through zero is the same. 


Figure 3.18: Schematics for a 2-bit 
FADC. MSB (LSB) stands for Most 
(Least) Significant Bit. 
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memory. If extremely high speed is not required, other ADC technologies 
have significant benefits (linearity, DNR, digisation noise etc.) [402]. 


3.9 RADIATION EFFECTS ON ELECTRONICS 


Radiation effects can induce permanent damage in bipolar and CMOS 
electronics. In addition, radiation can cause temporary effects, which are 
called single event effects (SEEs). 


RADIATION EFFECTS ON BIPOLAR ELECTRONICS 


For bipolar electronics the main effect of irradiation entails bulk damage 
in the base of a transistor (for both npn or pnp transistors). This results in 
an increase in leakage current in the same way as for any semiconductor. 
The leakage current results in a decrease of the DC current gain B of the 
transistor as the base current is increased. Smaller devices operating with 
the same collector current will be more radiation-tolerant as the leakage 
current is proportional to volume. The radiation tolerance will be greater if 
the transistors can be operated at higher collector currents because then a 
given increase in base current will reflect in a smaller change in the cur- 
rent gain B = i./iy, where i, is the collector current and i, is the base 
current. However, this results in increased power consumption. Newer 
technology has faster transistors with thinner base regions and therefore 
smaller volumes. This results in lower base leakage currents after radia- 
tion damage. This implies that there is a smaller degradation of the DC 
current gain p and therefore less sensitivity to radiation damage. However, 
bipolar devices will always show a decrease in p with radiation, but some 
decrease in p can be allowed for in the circuit designs. 


RADIATION EFFECTS ON CMOS ELECTRONICS 


As CMOS transistors rely on the transport of majority carriers, they are 
less sensitive to bulk damage of the silicon lattice. However, they are sen- 
sitive to radiation damage from ionising dose. Jonising radiation creates 
electrons and holes in the gate oxide. The holes diffuse slowly and can be 
trapped in the SiO» passivation layer present in CMOS technology. Ion- 
ising damage also creates traps at the interface of the silicon and SiO2. 

If this charge is negative, it can eventually neutralise the positive charge 
from the trapped holes but only over very long timescales. If the charge 
trapped is Qox and the capacitance is Cox then this causes a change in 
threshold voltage AV = Qox/Cox. If the change in threshold voltage is 
too large it can cause a transistor to be inoperable. 

With modern deep sub-micron CMOS technology, the thickness of 
the gate oxides has become so thin that quantum tunnelling will result in 
the trapped charges being neutralised. This has enabled very radiation- 
tolerant ASICs to be designed using commercial CMOS processes. How- 
ever, there are other oxide layers used in CMOS processes and these can 
be thicker and therefore not benefit from the release of trapped charge 
through quantum tunnelling. Radiation-induced trapped charge at oxide 
layers can cause increased leakage currents that limit the radiation toler- 
ance. Improvements can be made using “Hardness By Design’ techniques 
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We will discuss diode leakage current 
in the context of silicon detectors in 
more detail in section 9.5, and how it is 
affected by radiation in section 9.11. 


The ionising dose is described by the 
energy absorbed per unit mass and the 
ST unit is Gray (Gy, 1 Gy = 1 J/kg). 

The ionising dose needs to refer to a 
particular element or molecule. It is 
conventional to use silicon so the dose 
is specified in units of Gy(Si). The older 
unit of Rad is also still used (1 Gy = 
100 Rad). In LHC silicon detectors the 
lifetime doses are in the range from 
100 kGy(Si) to 1 MGy(Si). 

The threshold voltage in CMOS devices 
is the minimum gate-to-source voltage 
Vgs that is needed to create a conducting 
path between the source and drain 
terminals. 

‘Deep sub-micron’ refers to the feature 
size on the silicon wafer, which is typ- 
ically smaller than 0.18 um for these 
technologies. 
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to overcome these problems. One example is replacing conventional linear 
n-type MOS (NMOS) transistors with enclosed layout transistors. In the 
enclosed layout the source surrounds the gate which surrounds the drain. 
This approach has been very successful in designing radiation-tolerant 
electronics using commercial CMOS technologies but it does require more 
silicon area and it is therefore only used for the most sensitive areas [239]. 


SINGLE EVENT EFFECTS 


If too much energy is deposited in a transistor, this can flip the state of a 
digital logic cell. The energy deposited by a MIP is generally too small 
for this to occur but more heavily ionising particles such as heavy ions can 
cause this effect. 

In accelerator applications, the main cause of SEE is particles under- 
going nuclear reactions creating heavily ionising fragments. One typical 
type of SEE will result in a bit flip in a memory cell. This type of Single 
Effect Upset (SEU) might be rather benign if the cell contained transient 
hit data from a detector. However, if the cell was used as a register that 
affected the operation of an ASIC, it could result in a detector element be- 
ing inoperable until it was reset. It is impossible to avoid all SEEs and it 
is therefore essential to have effective mitigation strategies. The simplest 
one is to have periodic resets for all registers of the on-detector ASICs. A 
more effective strategy is to detect the SEEs and issue resets when errors 
are detected. 

A more fundamental approach is to use triple-event redundancy for 
critical registers. In this approach there are three instances of the register 
on the ASIC, and a ‘voting’ circuit to determine the correct value. So if 
one register is corrupted by an SEE, it will be out-voted by the other two 
copies of the register. This is a very powerful mitigation strategy but it 
obviously requires more silicon area on the ASIC, so it should only be 
used when needed. 
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Older CMOS processes were not suf- 
ficiently radiation-tolerant for LHC 
applications, but there were proprietary 
radiation-tolerant processes devel- 
oped for military applications. These 
were used for the first prototypes of 
radiation-tolerant electronics for LHC 
experiments. 


This is a major issue for electronics in 
space missions. 


It is essential that the three copies of the 
same register should be sufficiently sep- 
arated so that one nuclear interaction 
can’t deposit energy in more than one 
copy of the register. 
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Key lessons from this chapter 


e We can calculate the induced currents on a conductor caused by moving charges us- 
ing the Ramo-Shockley theorem. They are proportional to the product of velocity of 
the moving charge and the weighting field. Thus, a current is only induced while the 
charge is moving. 


e The most important fundamental noise sources in a combined detector and amplifier 
system are current noise from sources parallel to the detector and series noise from 
the amplifier. 


In a well-designed system it is the noise of the first amplifier that determines the sys- 
tem S/N performance. 


The signal-to-noise ratio can be optimised using pulse shaping. The optimal shaping 
time occurs when series and parallel noise are equal. This optimisation can be af- 
fected by other considerations, like high-rate operation. 


e The origins of the noise in BJT and FET amplifiers are distinct and this results in 
different optimisations for systems using the two technologies. 


Radiation damage in bipolar electronics results in a decrease in the DC current gain 
(B) and this limits the use at very high fluences. 


Radiation damage in modern deep sub-micron CMOS electronics is greatly reduced 
and the residual effects can be accommodated in the design. 


EXERCISES 


1. Signal and noise. 


Consider a Gaussian signal with mean u and a standard deviation o in the presence of a Gaussian 
noise source. The noise distribution has u = 0 and the same value of o as the signal. We wish to 
set a threshold value ż, such that signals x > ¢ are genuine (from the signal source) and not noise. 
We define the true efficiency to be the fraction of signal events which are above this threshold and 
the background efficiency to be the probability of a signal above threshold to arise from back- 
ground alone. Let u = 1 and consider the cases o = 1, 3 or 10. For these three cases sketch graphs 
of signal efficiency versus threshold and background efficiency versus threshold. Comment on the 
significance of these results. 


2. Laplace transform. 


Show that the Laplace transform of the function f(t) = (t/t) exp (—t/T) is given by eq. (3.14). 
If you are familiar with complex analysis, you can perform the direct calculation of the inverse 
Laplace transform of eq. (3.14). 


. Calculation of induced current using mirror charges. 
Consider a point charge of magnitude Q at a vertical distance z above an infinite grounded conduc- 
tive plane. 


a) Calculate the electric field just above the grounded plane as a function of the horizontal dis- 
tance r and hence the surface charge density. Hint: use a mirror charge. 
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b) The total induced charge on the grounded plate is —Q and clearly doesn’t change as the point 
charge moves. The grounded plane is divided into strips of width w in the x-direction (in- 
finitely long in the y-direction). One strip is centred below the point charge. Show that the 
charge induced on the central strip is given by 


= 2Q w 
q(z) = o arctan (=). 


c) Using the result from part b) show that if the point charge moved at a velocity v in the verti- 
cal direction towards the grounded plate, the induced current in the central strip is 


ws 4w 
i(t) = T oy 


4. Common source amplifier and cascode. 


This question calculates the gain of a simple common source FET amplifier and shows how it can 
be increased using the cascode circuit. 


a) Explain the small signal equivalent of a common source amplifier (compare Figure 3.6). 


b) Derive eq. (3.9) for the gain of the common source amplifier. 
c) Find an expression for the output impedance of the common source amplifier. 


d) The cascode amplifier has a common source stage with a common gate stage as the load for 
the common gate (see Figure 3.7). The input voltage is applied to the gate of the common 
source amplifier. Calculate the gain and output resistance of this cascode circuit. 


5. Optimal shaping time. 


Using eq. (3.17) determine the optimal shaping time and resulting noise. Assuming the use of a 
MOSFET amplifier operating at a current Jj = 2 mA and gm = 6 mS, determine the optimal shap- 
ing times and resulting ENC values for a detector element with a capacitance of 10 pF and with a 
leakage current of (a) 1 nA and (b) 1 uA. Comment on the significance of these results. 


6. 1/f noise from charge trapping. 


a) Consider a charge trapping/de-trapping process with a mean time between events of T. If the 
events are distributed in time according to an exponential distribution, use Fourier theory to 
determine the form of the resulting power spectrum. 


b) Now consider N such processes, with different time constants such that 74; = 107;, evaluate 
the resulting spectrum numerically for N = 2, 4 and 8 and plot a graph of the power spectrum 
and compare this with a 1/f spectrum. 


7. Noise from bias resistor. 


An often used component that contributes noise that we have ignored in our discussions in this 
chapter is a bias resistor, which is often required to connect a high voltage to the detector. 
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For small signals the DC high voltage appears as ground, and thus for the noise analysis this resis- 
tor appears in parallel with the detector capacitance. 


a) Why can we ignore the coupling capacitor Ce as a source in the noise analysis? 


b) Show that the noise spectral density due to Rp is given by 


e SS 
PP 1+ (@RpCa)? 


c) Hence, show that the equivalent noise charge due to this contribution, when integrated over 
all frequencies, is given by Q2 = kTCa. (This is the reason why this noise is often called 
‘kTC’ noise, but beware, the noise does not originate in the capacitor. Capacitors are noise- 
less!) 


d) Where in eq. (3.12) would we have to add this contribution? Hence, will it increase the paral- 
lel or the series noise component? 


4 Movement of charges and 
internal amplification 


In chapter 2 we have seen that the interaction of an incoming charged par- 
ticle with the detector material results in a number of energy transfers 
along the track that can lead to the ionisation of the detector material, cre- 
ating pairs of negative (usually electrons) and positive (positive ions or 
holes) charge carriers. In chapter 3 we have seen how the movement of 
charges creates a current signal on a system of readout electrodes of an 
electronic particle detector. To create this signal, we have first, to separate 
the charges, and then, second, to keep them moving, as the induced cur- 
rent depends on their drift velocity in the external fields. To collect these In semiconductors we can also use 
k i A R diffusion to create a signal (as used in 
charges for electrical readout, they need to be mobile, with a low probabil- some CMOS image sensors for cam- 
ity for capture and recombination. Viable detector materials are therefore €728) but the resulting signal is usually 
. : . . too slow for use in particle detectors. 
non-polar pure gases or liquids, or pure or lightly doped semiconductors. 
As the charges drift, they will collide with atoms in the material. Elas- 
tic collisions will affect their trajectory, affecting their position in a pro- 
cess that we describe macroscopically as diffusion. In addition, once the 
energy acquired by the electrons between collisions becomes sufficiently 
large, inelastic collisions will occur. One type of inelastic collision is 
ionisation of the atom with which the electron collides, resulting in an 
avalanche growth of the drifting charge cloud. This can be used to inter- 
nally amplify the signal of a particle detector, thus reducing the demands 
on external electronic amplification and noise performance. 


4.1 CHARGE DRIFT IN GASES 
DRIFT VELOCITY 


When a charged particle is moving through a gas, it will undergo colli- 
sions with the molecules in the medium. If we apply an electric field, the 
charge will be accelerated between collisions. As a consequence, the par- 
ticle will have a varying microscopic drift velocity v(t). Macroscopically, 
the movement can be described by the Langevin equation 


m— =eE+A(t), (4.1) 


where i is the macroscopic drift velocity, and A (t) is a force term describ- 
ing the interaction between the drifting particle and the gas molecules. On 


> 


average, the latter can be described by (A(t)) = —mii/t, where T is the 
average time between collisions. In the steady state this leads to 


Due to the 1 /m dependency the drift 
= eT = > velocity for electrons in gases is about 
ü= —E = HE, (4.2) 103 times larger than for ions. 
m 


where we have introduced the mobility u. 
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Ions, being of similar mass than their collision partners, will lose a 
significant fraction of their energy in collisions. Consequently, they will 
acquire only limited amounts of energy from the field, and as long as the 
fields are not becoming very large, most of their energy will be thermal, 
€ = 3kgT /2 (about 0.038 eV at room temperature). Hence we can esti- 
mate the microscopic relative speed vre; between ions and the molecules 
from m* (v2) /2 = 3kgT /2, where m* is the reduced mass of the ion/gas 
molecule system. Thus, 


eT é é 


= E~ 


«mn T ia” 
m MNO Ve} no (3kpT m*) 
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with the momentum transfer cross-section o. The macroscopic ion drift 
velocity for low fields is thus proportional to the field E. It is also propor- 
tional to n—!, which is proportional to the inverse of the pressure, p~!, 
and thus the drift velocity scales with E/p, implying that the mobility 

is constant at a given pressure. Typical ion mobilities are between | and 
10em?V~'s*. 

In gas mixtures the charge of ions with a high ionisation potential gets 
transferred in collisions to ions with lower ionisation potential, so that it 
will be these that will define the ion mobility. 

For electrons the situation is generally different. In steady state there 
is a balance between energy obtained from the field over a distance x, eEx, 
and the energy lost in N collisions over that distance, given by NeA(€), 
where € is the energy of the electron and A(é) is the average relative en- 
ergy loss per collision. Hence, 


eEx = NeA (£) = Z enle) = EE) ~ 
S x(no(e)(v))? miv?) 
-~ eEjm 2 ALE, 


where we used T = (no(€)(v))~! and € = m(v’) /2. The solutions of these 
equations for the macroscopic drift and microscopic velocities are then 


o [cm] 


10° 10° 10° 1 10 10° 10° 10" 1 10 
e [eV] e [eV] 


For electrons the momentum transfer cross-section and the fraction 
of energy lost per collision are both strongly dependent on the energy 
of the electron and the type of molecule. In gases with molecules with a 
high polarisability the cross-section shows a pronounced dip due to the 
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n is the volume number density of 
the molecules. The mean free path 
is A = 1/(no) and therefore the 
mean time between collisions is 
T=1/(NOVye1). 


E/p is also referred to as the ‘reduced 
field’. 


On slightly longer timescales ions and 
atoms will also form charged clusters. 
This will be discussed in section 7.3. 


Strictly speaking, the energy here is 
only the energy due to the electric field, 
without the thermal energy. However, 
for electron drift in gases the thermal 
energy is usually negligible. 


We approximate (v)? ~ (v). 


Figure 4.1: Collision properties far 
argon and methane as a function of 
the energy (modified from [148]). Left: 
Effective collision cross-section. Right: 
Fractional energy loss. 


The polarisability œ is defined as the 
ratio of the induced electric dipole 
moment of a molecule, and the field 
generating it, & = Pina/E. 
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Ramsauer effect. As the electron energy increases from zero, the elec- 
tron becomes more localised, resulting at first in higher polarisation of 
the molecule, which increases the magnitude of the scattering potential, 
resulting in an increase of the partial wave phase shift dy. Only at higher 
energies this eventually decreases due to short-range repulsive interac- 
tions, leading to a condition where the scattering length and thus the cross- 
section goes through a minimum [104]. Gases with high molecular polar- 
isability are the heavy noble gases (argon, krypton and xenon, but not he- Nea looking Tor non-polar, but polar- 
lium or neon) because of their large electron cloud, and spherically sym- 
metric organic molecules, like methane or neo-pentane. 
The energy dependence of the fractional energy loss reflects the addi- 
tional degrees of freedom available to three-dimensional molecules that 
turn on at low energies. 
As a consequence of these variations, drift velocities vary significantly 
for different gases and different values of E /p. Typical electron drift ve- 
locities are between 0.1 and a very few 10 cm/us. 


Figure 4.2: Electron drift velocities 
Electron drift velocity in various gases for different gases as a function of the 
reduced field [291]. 
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Similar arguments will also apply for ions at higher fields, but because 
of the smaller energies retained in the collisions without the dramatic vari- 
ations of the cross-section and the fractional energy loss. Nevertheless, at 
high fields the dependency of the ion drift velocity on the field will change 


from <E to «/E, reflecting the transition from the thermal regime to the 
field-dominated regime. 


DIFFUSION 


The collisions with the gas molecules are stochastic. Macroscopically, this 
results in slight variations of the drift velocity, which manifest themselves 
as diffusion for an ensemble of drifting charge carriers. From the diffusion 
equation a Gaussian distributed cloud of charge carriers will expand with 
time in one dimension as 


= no x? 
n(x) = Jape ( x) ; (4.3) 


where D is the diffusion constant. The width of the cloud can be described 
by its standard deviation, which increases as Oy = v 2Dt. This spreading 
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80 Figure 4.3: Ion drift velocities for 
different ions and gases as a function of 
the reduced field [148]. 
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also applies if there is a superimposed average macroscopic drift velocity, 
so that x + x — ut. 

After N = t/t collisions, each with a displacement &;, the total dis- 
placement is given by x = PL 1 6i. Hence, 


N 2N 2 We use (v2) = (v2) +02) +02) =302) 

2 2 2 2/42 2) f42 2N ed 2 e of isotropy, 2 2¢2. for 

Or = P= NED = NYE) = VE) = ZT = BV telson tne dibtion F0)— 
tT exp(—t/t). 


where (£) = m/v?) /2 is the average energy of the electrons. 
In the case that the energy of the electrons is given by the thermal limit 
(€ ) = 3kpT / 2, 
D= kgT t= kgT (4.4) This equation is known as the Nernst- 
m e ? Townsend or Einstein equation. 


and the standard deviation of the diffusion cloud is growing like 


2kgT ut 2kgTl 
e eE 


where / is the drift distance. In real gases at high reduced field E /p, 
(£) > kgT and 
Al(e) 


ah ck (4.6) 


Gases, which stay close to the thermal limit up to high values of the 
reduced field E/p (for example CO2) are called ‘cold’ gases, the opposite 
being ‘hot’ gases (for example argon). 

For a cloud of drifting electrons diffusion in the direction of the drift 
field is reduced as electrons in the leading edge of the cloud have a higher 
average speed and hence a higher collision rate. Similarly the mobility of 
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Figure 4.4: Standard deviation of diffu- 
sion after 1 cm drift in different gases 
0.2F for different electric fields at atmo- 
spheric pressure (modified from [396]). 
Full lines are from experiment, dashed 
lines are calculated. The dotted line is 
the thermal limit for room temperature 
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the trailing edge is enhanced, and consequently, while the diffusion equa- 
tion still holds in principle, the half width of the pulse in the field direction 
is characterised by a reduced diffusion coefficient [398]. 


DRIFT IN MAGNETIC FIELDS 
In a magnetic field the Langevin equation becomes 


di es e/, > 
ee +£ (ax B)- 
m 


ii. 4.7 
dt m “ 


In steady state dii/dt = 0, and the solution for the drift velocity of a drift- 
ing electron is 


e T = OT = > OT > >»\ > 
i= E+E xB (E-B) 5 , 4.8 
“ E rap | B "Be | oe) 


with the cyclotron frequency @ = eB/m,. 
In tracking detectors the E and B fields are often perpendicular, as the 


electron drift, which is providing the accurate position information, and ale me aa of drift cham- 
. . . . . . ° ers see Section /.). 
the bending of the incoming particle track are both in the same direction. 
In this case the solution reduces to (using E = (E,0,0) and B = (0,0,B)) Z 
e T B 
Ux = -7 pA E 
Me 1+ 04T y 
f T otE 
ij 
i me 1 + @?T? , 
uz =0. E 
; Bat x 
Hence, the motion of the electron is rotated around the direction of the B u 


field by an angle ¥ = atan (uy/ux) = atan(—@r). ¥ is called the ‘Lorentz E E EAE orthogonal 
angle’. The speed of the drifting electron is given by u = (e /me)TE cos ¥, i 

hence it is determined by the electric field in the drift direction. Similarly 

to the drift velocity, the Lorentz angle depends on the drift gas and the 

reduced field E /p. 
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Another important field configuration is a parallel E and B field. In 
that case the macroscopic drift velocity is not modified by the magnetic 
field. However, microscopically the trajectories of the drifting electrons 
between collisions become helices around the field direction, with a radius 
p = (v/@)sin(@), where @ is the angle between the electron trajectory 
and direction of the magnetic field. The position of an electron emerging 
from a collision at the origin with an original direction ® moving for a 
trajectory length / in the magnetic field is described by 


x(I) =p sin (= ~o) +sino , 
=p feos (= -2) -coso 


z(l) =/sin@. 


The mean square displacement in one direction can be found from the 
integration over the collision length distribution, 


(x?) = ax f 200" sin 0 dO dọ di, 
0 
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Figure 4.6: Lorentz angle in different 
gases [102]. 


Figure 4.7: Electron drift in the CDF 
outer tracker simulated with MagBoltz 
and GARFIELD (modified from [25]. 
Argon/ethane/CF 4 50/35/15 in a drift 
field of 2.5 kV/cm and for a Lorentz 
angle of 35°, wire diameter 40 um. 
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with the mean free path A, which yields 


noa 2 ā X AP 
W= = 375 wae ra 
2) 272 
= 34", 


and thus the transverse diffusion of a cloud drifting in the field direction 
is reduced, D,(B) /D,(0) = (1 + @?1t7)~!. The longitudinal diffusion is 
unaffected. 


ELECTRON ATTACHMENT 


The tendency of a molecule to capture an electron is given by the electron 
affinity, which is the energy released or required in the capture process. 
Attachment is relevant for gases with high electron affinity (e.g. halogens, 
but also oxygen or water), whereas it is absent for gases with a small or 
even negative electron affinity (e.g. noble gases, nitrogen, CO2). To not 
lose electrons in the collection process, detector gases usually are chosen 
to have low electron affinity, but gases with high electron affinity can be 
present as contaminants. 

A practical parameter to describe electron attachment is the attachment 
coefficient, which describes the probability for attachment for a collision, 
which depends on the electron energy. For electron gases with high elec- 
tron affinity this will be large at low energies and decreases for higher en- 
ergies. For negative electron affinity gases the attachment coefficient turns 
on at a threshold energy and then increases slightly with energy. 


COMPLETE TRANSPORT THEORY 


A complete solution for all drift properties can be found from the solution 
to the Boltzmann transport equation, 


of > È de mr > D = = — of 
S400, f+e (E +08) Ufa = (F) (4.9) 


where f = f (F,V,t) is the six-dimensional phase-space probability density 
function for a single particle, and J is the collision operator. J is a local 
operator in F and ź following the assumption that both the range and the 


duration of collisions are negligible. The operator J depends functionally 
on the neutral distribution and interaction cross-sections, which comprise 


elastic and inelastic (excitation and ionisation) scattering cross-sections. For a detailed discussion of the Boltz- 
EA A . oe ag f mann transport equation applied to the 
The distribution function f(F,V,t) can then be expanded in a power problem of electron drift see [407, 323]. 
series of the spatial gradient of the number density n/(7,r), 
n(7,t) =f f(#, 0,1) av. 
co 
oe kis S\k o/s 
FFI) = L SOOS (Vn), (4.10) 
k=0 


where (-V)* represents a k-fold outer product of the gradient operator 
with itself and & indicates a k-fold inner-product operation. The coeffi- 
cients in the expansion, f (k) (V), are tensors of rank k. 
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Under ‘hydrodynamic conditions’, a related equation 
dn(7,t) << 


= (k) _voky(z 
a = 20 @ (—V)*n(7,t) (4.11) 


can be used. Here, the tensor parameters o) can be associated with the 
relevant observables for the transport process: for a uniform density — ol 
is the reaction rate, the vector œl!) with the drift velocity, and the rank-2 
tensor œ?) with the diffusion coefficient. 

Inserting eq. (4.10) into the Boltzmann transport equation, eq. (4.9), 
rearranging and using eq. (4.11) yields 


0) 


k 
E (E+0xB).¥, -| FOE) = 70) - F ol (IH, 


Me 
and we can, in principle, find o and f (*) from all the previous ol!) 
and f (k1) | For this the collision operator J has to be found from the elas- 
tic and inelastic cross-sections. The calculations are quite elaborate and 
typically involve expansion of the f% using Legendre polynomials, which 
are truncated at some suitable low order. For the details see [323]. 

Today a more practical approach is based on Monte Carlo calculation, 
which tracks the microscopic path of a particle through a large number of 
collisions [251]. The path of the particle is split into path segments and 
the probability for collisions within each path segment and their outcome 
(elastic collision, excitation, ionisation, etc.) are calculated from cross- 
section data. The main challenge of the calculation is to find appropriate 
lengths for the path segments, to obtain convergence of the results. This is 
the approach used by the MagBoltz software to calculate transport proper- 
ties of electrons in a wide range of gases [137]. 

A lot more experimental data are available for transport parameters 
in a wide range of gases than direct measurements of the collision cross- 
sections. Thus, the cross-section data required for these calculations are 
reverse-calculated from the measured transport properties, either from the 
solution of the Boltzmann transport equation, or from the application of 
the Monte Carlo method. 


4.2 CHARGE DRIFT IN LIQUIDS 


Charge transport in liquids is only possible in liquids with non-polar 
molecules. The main challenge is therefore to avoid contamination with 
substances with high electron affinity, as the high density of attachment 
centres in the liquid will lead to a quick depletion of the ionisation charge. 

In a pure dielectric liquid the conductivity is low (<107!8 Q-!cm7!), 
because the energies of electrons bound in the liquid and the energy of ex- 
cess electrons is separated by about 6 to 9 eV. This gap is slightly smaller 
than in gases, because polarisation effects tend to increase the energy of 
bound electrons by 1 to 1.5 eV, whereas excess electrons move in a con- 
duction band, which is below the potential energy of a free electron by 
about 0.2 to 1 eV. 

In liquids the detectable ionisation charge is smaller than the generated 
charge, because the time needed for thermalisation is much shorter than 
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When the particle swarm evolves with- 
out memory of initial conditions and 
unaffected by boundary constraints. 


These cross-sections generally can 
depend on the scattering angle. 


MagBoltz can be accessed through 
the GARFIELD software package (see 
section 7.8). 


An open-access website with elec- 
tron and ion scattering cross-sections 
is [352]. 


For a more detailed summary of charge 
drift in liquids see [231]. 


Polar molecules have a positively 
charged end that will capture electrons. 


A conduction band will emerge when 
the mean free path becomes comparable 
to the wavelength of the electron. 
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in gases. Hence there is a significant probability for recombination of a 
newly created electron-ion pair, and the charge yield (defined as the num- e mee ae ae 
ber of electron/ion pairs created by an energy loss of the incoming particle recombination’, if charges recombine 
of 100 eV), increases with the electric field. Typical values for the charge  4fter some drift. 
yield are from 2 to 4 for argon from zero field to 10 kV/cm, and about half 
of that for dielectric organic liquids. 
Still, because of the higher density the passage of charged particles 
in liquids generates more ionisation charge per unit track length than in 
gases. Nevertheless, we would not expect this to result in larger induced 
current signals, as we would presume based on eq. (4.2) that the mobil- 
ity of the charge carries would decrease proportionally to p~!, because 
of the decrease of the mean free path. However, this is not true in liquids 
with molecules with high molecular polarisability, where the density in- 
crease weakens the long-distance polarisation attraction between the elec- 
tron and a molecule in the liquid, and thus the electron free path becomes 
larger [104]. 


10'E Figure 4.8: Electron drift velocity as 
E a function of electric field in different 
C liquids (methane at 111 K, ethane 
L at 200 K, others at 295 K, modified 
from [288]). 
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The calculation of transport properties in liquids is more complicated 
than for gases, because of the small distance between molecules and their 
highly correlated separations. Nevertheless, a multi-term approach to the 
solution of the Boltzmann equation with liquid phase cross-sections can be 
used for this calculation [160]. 
The classical liquids used in particle physics are liquid noble gases, 
and in particular argon, due to its high charge yield and electron mobility, 
and moderate cost. Purification of cryogenic liquids is a by-product of the 
liquefaction process, but the obvious drawback of these liquids are the 
low temperatures required for liquefaction, with the associated need for Liquefaction temperatures are —186 °C 
i 7 é i for argon, —153 °C for krypton, —108 
thermal insulation and substantial cryogenic plants. °C for xenon (at atmospheric pressure). 


Examples for organic liquids with high electron mobilities used in par- 
ticle detectors at room temperature are tetramethylsilane (TMS) or 2,2,4,4- 
Tetramethylpentane (TMP). The main challenge for these liquids is the 
purification, but the viability of the technology for large scale experiments 
has been demonstrated for the KASCADE experiment [409]. 
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4.3 CHARGE DRIFT IN SEMICONDUCTORS 


In solids electron states occur in energy bands. Typically most outer elec- 

trons reside in a filled valence band. These electrons cannot drift under the 
influence of an external electric field because of the Fermi exclusion prin- 

ciple. To move, they have to be in the higher-energy conduction band. We 

distinguish three cases: 


E 
conduction band 
E >5 eV ._slectrons „| y 
-E E a eee a a 
holes y 
valence band 
T=0K T>OK 
metal insulator semiconductor 


e In metals, the conduction and the valence band overlap. The number 


of mobile electrons far exceeds the number of electrons generated 
by ionisation, and the ionisation signal is unobservable. 


e In semiconductors, there is a finite, but small band gap (a few eV). 
Some electrons are lifted to the conduction band thermally, but this 
population can be kept small by lowering the temperature or reduc- 
ing charge carrier concentrations by doping and biasing. Electrons 
can then be raised from the valence to the conduction band by the 
interaction with the incoming particle. This then leads to an ‘elec- 
tron’ in the conduction band and a ‘hole’ in the valence band. The 
hole behaves effectively like a mobile positive charge. The resulting 
electrons and holes can be seen as moving to the readout electrodes 
under the influence of an electric field. 
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e In insulators the band gap is larger (>5 eV). Insulators can be used 
as particle detectors (for example diamond with a band gap of 
5.5 eV), but only if the material is very pure. Normal insulators have 
too many interstitial states, which will trap the electrons and result 


in local de-excitation, strongly limiting the range of the electrons, so 


that a useful signal cannot be generated. 
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Figure 4.9: Band structure in solids. 


See section 9.1. 


Both ‘electrons’ and ‘holes’ in the 
semiconductor are intuitively sim- 

ple concepts that are widely used, but 
should not be confused with free nega- 
tive or positive charges. They describe a 
quantum mechanical state of the lattice 
with its electrons as a whole, similar 

to electrons in an atom that cannot be 
described individually, but are part of 
the wavefunction of the whole atom. 
Microscopic classical pictures of the 
transport process are of limited validity. 


Figure 4.10: Energy used per electron- 
hole pair vs band gap in different 
materials [395]. The difference between 
the energy used per electron-hole 

pair and the band gap is due to lattice 
excitations (phonons) to conserve 
4-momentum, and damage to the 
lattice [309]. The dotted line is the 
line where the energy to create an 
electron-hole pair equals the band gap. 
Other lines are the result of a linear 
fit to two groups of data following 

the parameterisation given in the 
figure and explained in the references. 
In silicon, the band gap is 1.12 eV, but 
the average energy used per electron- 
hole pair is 3.6 eV. 
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The drift of charge carriers in semiconductors is, like in gases or 
liquids, governed by collisions in the material. In semiconductors two 
sources of collisions are relevant: Scattering on the lattice, i.e. interac- 
tions with phonons, which are acoustic at low energies and optical at high 
energies, and scattering on ionised impurities. 

It can be shown that the mobility for scattering on acoustic phonons 
Ly is proportional to T~3/?(m*)—>/?, where m* is the conductivity ef- 
fective mass [113]. However, at larger field strengths the drift velocity 
saturates. In materials without accessible higher bands (for example sili- 
con), the cause for the saturation is inelastic scattering of the charge car- 
riers with the emission of optical phonons. If the energy of the phonons 
is given by Ephonon, then the speed, at which this occurs will be given by 
v = (2Etongn/ Mz) 1/2. In silicon, the conductivity effective masses of elec- 
trons and holes are similar (0.26 me and 0.39 me, respectively), and the 
ratio of saturation velocities is close to 1 (vs*"/ v ~ 5/4). In GaAs the 
electron conductivity effective mass (0.067 me) is much less than the hole 
effective mass (0.34 me), resulting in a much higher saturation velocity for 
electrons at moderate electric fields. At higher fields the mobility in GaAs 
actually goes down, due to the existence of two local valleys in the band 
structure, one being a high-mobility region, and the other a low mobility 
region at higher energy. At high fields the charge carriers can be lifted to 
the low-mobility valley, resulting in a decrease of the drift velocity. 
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The contribution to the mobility from impurity scattering is 
Li x 73/2 rn), where n; is the impurity density [211]. For typ- 

ical doping concentrations in the active areas of silicon detectors 

(ni < @(10!”) cm~3), the electron and hole mobilities do not vary sig- 
nificantly with impurity concentration [70]. 

Thermal movement of the lattice manifests itself in oscillations of the 
scattering centres around their equilibrium positions, and thus the mobility 
for scattering on the lattice for both charge carrier types depends on the 
temperature. 

A practical parameterisation of the temperature dependence is 


(7) uk), 


where Tọ is a reference temperature, usually 300 K, and 0 is 2.4-2.5 for 
electrons and @ ~ 2.2 for holes in silicon. 


u} (T) = 
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Acoustic phonons are coherent move- 
ments of atoms of the lattice out of 
their equilibrium positions, resulting 

in denser and less dense areas if the 
displacement is in the direction of prop- 
agation, like in a sound wave. Optical 
phonons are opposite displacements of 
adjacent atoms in the lattice. They are 
called optical because in ionic crystals 
such fluctuations in displacement create 
an electrical polarisation that couples 
to the electromagnetic field. 


This is called the ‘transferred electron’ 
effect. 


Figure 4.11: Drift velocity dependence 
on the electric field in high-purity 
silicon and GaAs at 300 K (modified 
from [468]). 


The combined mobility due to impurity 
and lattice scattering can be found from 


1/u =1/4i + 1/4. 
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A widely used parameterisation for the mobility as a function of the 
electric field is [181] 


—1/B(T) 


gNBO) 
uE) =mi + ($) ; 


where Up is the low-field mobility, Ee an empirical fit parameter, and B(T) 
is a temperature dependent parameter, which has a value of order unity. 
Diffusion again increases like the square root of the drift distance, and the 
diffusion coefficient can be found from the Nernst-Townsend equation, 
eq. (4.4). 

In the presence of a magnetic field, the drift is described similarly to 
eq. (4.8) in gases, 


u 


i = —— 
1+ uB? 


[E + un x B+ uj (E-B) B], 

but with the Hall mobility Uy = ryu, with the Hall scattering factor ry, 
which is about 1.15 for electrons and 0.7 for holes in silicon [468]. Be- 
cause of the similar expression for the drift velocity the direction of the 
charge carriers in orthogonal electric and magnetic fields is again rotated 
by the Lorentz angle from the direction of the electric field. 


4.4 INTERNAL AMPLIFICATION 


Often the ionisation charge produced in the detector material is too small 
to be sensed amongst the irreducible electronic noise that is present in an 
electronic readout system. This is particularly true for gaseous detectors, 
where the primary ionisation can be as low as a few ten to a hundred elec- 
tron/ion pairs, but even in semiconductor detectors, where the ionisation 
from a minimum-ionising particle can be a few thousand electron/hole 
pairs, a primary signal increase can be beneficial for small signals (for ex- 
ample in response to low intensity visible light). 


Figure 4.12: Mobility of electrons 
and holes in silicon as a function of 


temperature [468]. 


For electrons in Si at T = 300 K, 
B = 1.3, Uo = 1450 VAcm’s), and 


E; = 7240 V/cm. 
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In these cases internal charge multiplication as part of the creation of 
ionisation signal in the detector is required, which amplifies the charge 
generated in the detector before the signal is collected on the electrodes. 
This is achieved by impact ionisation in high electric fields in the detector. 
The primary electrons get accelerated in the high electric fields in between 
collisions with atoms inside the detector, so that they get sufficient en- 
ergy to ionise another atom in the next collision. The same will happen for 
holes, but at higher electric fields, due to their lower mobility. The differ- 
ence between the ionisation rates for electrons and holes is very important 
for Avalanche Photo Diodes (APDs) as discussed in section 9.9. 


PHOTOMULTIPLIER TUBES 


A particle detector that exploits impact ionisation on a macroscopic scale 
is the photomultiplier tube (PMT). 
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A PMT is a device to detect photons in the visible and near-UV range. 
The sensitive element in the PMT is a photocathode, in which the incom- 
ing photons create photoelectrons. A critical figure of merit for the photo- 
cathode is the quantum efficiency (QE), which is defined as the probabil- 
ity of an incident photon resulting in an emitted photoelectron. There are 
three stages which affect the QE [316]: 


1. A photon is absorbed in the cathode by an atom with the emission 
of an electron (‘photoelectron’); 


2. The photoelectron moves to the surface of the cathode; 
3. The photoelectron escapes from the cathode into the vacuum. 


The efficiency for the first stage depends on the reflection at the cathode 
surface and the probability of photon absorption within the finite thick- 
ness of the cathode. In the second stage the photoelectron will lose energy 
Eoss- In order for an electron reaching the edge of the cathode to escape, it 
must have a kinetic energy (T) greater than the barrier height (e®,) at the 
photocathode surface. 

For a metal, there are electrons in the conduction band. The barrier 
height is given by e4 = Eyac — Ep, where Eyac is the energy of an electron 
in the vacuum and FF is the Fermi energy. In a semiconductor, the photon 
absorption comes from interactions with electrons in the valence band. 
Therefore the barrier height is ep4 = Eg + e%s, where Eg is the band- 
gap energy and e7, is the energy difference between the bottom of the 
conduction band and the vacuum [316]. The photoelectron initially has an 
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We frequently use the term charge col- 
lection, but as we have seen in chapter 3 
the current signal is induced during the 
motion of the charged particles until 
they arrive on the electrodes. 


For a detailed mathematical description 
of avalanches in thin silicon layers 
see [500]. 


For a comprehensive discussion of 
PMTs see [405]. 


Figure 4.13: Schematics of a box and 
grid type photomultiplier tube. 


This happens by electron-electron and 
electron-phonon interactions. 


Note this is different to the case of 
single isolated atoms. Here we are 
considering electrons in a solid. ®4 is 
called the electron affinity. Semiconduc- 
tors can be engineered to have negative 
electron affinity and therefore lower 
barrier heights and therefore better QE 
at longer wavelengths [316]. 
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energy T = hv, where v is the frequency of the photon. Therefore for the 
photoelectron to escape hv — Ejoss > eP 4. 

The QE for metals is too low to make useful photocathodes. High 
quantum efficiency photocathodes are therefore usually made using com- 
pound semiconductors. These will have much lower reflection than metals, 
and because the electron density in the conduction band is very low, there 
will be very little electron-electron scattering. Semiconductors with high 
QE are usually made from a combination of antimony (Sb) and one or 
more alkali metals, e.g. (NaK)3Sb. These semiconductors have small band 
gaps and very high absorption coefficients for UV photons. They also have 
small values of the electron affinity. The QE can be improved by appropri- 
ate doping to change the Fermi energy level. In addition a very thin layer 
of a few atoms of Cs can lower the energy of the top of the conduction 
band to be below the vacuum energy level (‘negative electron affinity’). 
Photoelectrons are emitted by quantum tunnelling through a very thin sur- 
face barrier [316]. 
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The photocathode only needs to be a very thin layer, and is usually 
created by vacuum deposition on the rear of the photon entrance window. 

In the PMT the photoelectrons emerging from the photocathode are 
guided by electric fields onto a dynode. The field is designed to maximise 
the fraction of collected electrons, and to accelerate the electrons to suf- 
ficient energy, so that the impact of the electron on the dynode liberates 
additional electrons from the surface. This process continues then on a 
set of typically 10 to 14 additional dynodes that are held at progressively 
lower potential by a voltage divider network. The electric field between 
the dynodes accelerates the electrons, so that more electrons are ejected 
in the impact on the following dynode. A modest ‘gain’ in the number of 
electrons at each dynode can result in a very large overall gain by using 
multiple dynodes. To allow for the travel of the electrons and to protect the 
photocathode and dynode surfaces the photomultiplier is contained in an 
evacuated glass tube. 

Photomultipliers are cheap, yet efficient and fast photon detectors with 
a large acceptance. They are therefore ideal for the detection of Cherenkov 
photons in large neutrino experiments, or for the optical readout of large 
calorimeters. The high speed makes PMTs very capable timing detec- 
tors with good time resolution (small ‘time jitter’), typically at the level 
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Eoss is the energy the photoelectron 
loses before it reaches the surface. 


Metals have a very high reflectivity for 
the incoming photons. Also the large 
density of free electrons results in very 
rapid energy losses of the photoelectron 
by electron-electron scattering. There- 
fore the probability of a photoelectron 
reaching the surface with sufficient 
energy to escape is very low. 


Photocathodes using two alkali ele- 
ments are called ‘bi-alkali’ photocath- 
odes. These can be selected to achieve 
higher QE at longer wavelengths than 
‘mono-alkali’ photocathodes. For good 
QE for visible photons multi-alkali 
photocathodes are used. 


Figure 4.14: Quantum efficiency for 
different photocathode materials as a 
function of wavelength [317]. Bialkali 
photocathodes have a good sensitivity to 
the wavelengths of most commonly used 
scintillators. Multialkali photocathodes 
can be used if sensitivity at longer 
wavelengths is required. GaAsP and 
GaAs are more expensive and are only 
used for very small area photocathodes. 


The typical mean free path in the pho- 
tocathode for a photon with 1 to 3 eV is 
less than I um (see section 2.1). 


A dynode is an electrode that is opti- 
mised for secondary electron emission. 
Unlike the photocathode the dynodes 
do see significant currents, and thus 
typically consist of an alkali on a good 
conductor (metal). 


Typically 3-5 secondary electrons are 
ejected by an impinging electron with 
an energy of 100 to 200 eV. 


Typical total gains can be as large as 


6 (10°). 
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of ns. The time resolution is dominated by the variations of the transit time 
for photo- and secondary electrons emitted in different directions. The 
main drawbacks of PMTs are their large size and hence limited position 
resolution, and that they are strongly affected by magnetic fields. They 
are therefore often enclosed in a layer of ‘u metal’, a soft, ferromagnetic 
nickel-iron alloy. 

The dominant source of noise in a PMT is thermionic emission. If this 
happens in the photocathode or the first dynodes, the emission will re- 
sult in an output signal that cannot easily be distinguished from a photon 
signal (this is referred to as “dark current’). If purity of the signal is re- 
quired noise signals need to be rejected by coincidence techniques (see 
section 13.2). 

A large number of PMT models with different electrode configurations 
are available, providing a wide range of sizes, acceptances, timing perfor- 
mances and spectral sensitivities. 


MICROCHANNEL PLATES 


A microchannel plate (MCP) detector consists of one or more plates made 
of resistive material (typically glass), perforated with small holes (typi- 
cally 5 to 20 um diameter). 


Photocathode 
THUY wer 
PEELS tee 


Anode 


The inside of the holes is coated with a semiconductor material, and 
the faces of the plate are metalised. A voltage is applied between the sur- 
faces of the plate, which accelerates electrons in the holes and impact ion- 
isation takes place on the inner hole walls. To increase gain, the holes are 
not exactly perpendicular to the faces of the plate (typically they are in- 
clined by 10°). Consecutive plates have opposite inclinations. 
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Another contribution to the resolution 
of the time measured when the signal 
crosses a threshold is from pulseheight 
variations due to the stochastic amplifi- 
cation process. This can be overcome by 
the use of constant fraction discrimina- 
tors (see section 3.8). 


u metal shielding works very effectively 
for low magnetic fields but there is a 
saturation effect so they do not pro- 
vide sufficient shielding for the large 
magnetic fields often used in high en- 
ergy physics experiments. Therefore 
alternative detectors such as avalanche 
photodiodes (see section 4.6) are used 
for readout of scintillators in regions of 
high magnetic field. 


Figure 4.15: Different PMT geome- 
tries [230]. (Courtesy of Hamamatsu 
Photonics K.K.) The diameter of the 
largest tube in the picture is about 
50 cm. 


Figure 4.16: Schematics of a chevron 
MCP. 


An MCP is a continuous-dynode elec- 
tron multiplier. 


For two plates this is called a ‘chevron’ 
MCP, for three a ‘Z stack’ MCP. 


Movement of charges and internal amplification 


Because of the localisation of the amplifying holes MCPs have a high 
spatial resolution and are often used in image intensifiers in night-vision 
devices. The small size of the amplification channels results in very good 
time resolution. In experimental physics they are therefore the ideal de- 
tectors for time-of-flight mass spectrometry, and they are used in particle 
detectors using precision time-of-flight measurements for particle identifi- 
cation, e.g. TORCH [399]. 


4.5 AVALANCHE MULTIPLICATION IN GASES 


Due to the low density, the amount of primary ionisation in gaseous de- 
tectors is small (typically a few 10 electron-ion pairs/cm). Internal am- 
plification is therefore a must. This is achieved by very strong electrical 
fields (>100 kV/cm), so that an electron drifting through the gas acquires 
enough energy between collisions with the gas molecules that it will ionise 
the next molecule it collides with. 

Once this condition is met, the number of charge carriers will grow 
exponentially into an avalanche of electrons. At this point we assume that 
the increase of electrons in the avalanche is proportional to the number 
of electrons already present, d(N-) = a(N-) dx, where the proportional- 
ity constant & is called the ‘(first) Townsend coefficient’. Predominantly, 
every collision will create one additional electron-ion pair, and thus 1/a 
corresponds to the mean free path between collisions. As the Townsend 
coefficient depends on the field and the mean free path between collisions, 
it scales with the reduced field E/p, where E is the field strength, and p 
the pressure of the gas, once the field strength exceeds the threshold for 
avalanche creation. 

In general, we also have to allow for a competing mechanism, electron 
attachment. The effective increase in the number of electrons is 


d(Ne) = (Ne) (@— n) dx, 
with an attachment coefficient n. This equation can be integrated to get 


(Ne(x)) = Noe, (4.12) 
a* = a—17 is called the effective ionisation coefficient. 

A = (Ne ginal) /No is called the ‘gas gain’. In general, both the 
Townsend and the attachment coefficient do depend on the local field 
strength. This will vary with location for general field geometries, and in 
addition the local field strength will be modified by the space charge cre- 
ated in the avalanche itself, if the number of electrons in the avalanche is 
large. The exact avalanche properties can thus in general not be calculated 
analytically, but they can be obtained using Monte Carlo techniques (see 
for example [338]). 

Due to the absence of rotational and vibrational modes, ionising col- 
lisions in noble gases have no competition, and thus avalanche multipli- 
cation in noble gases starts at lower field strengths than for more complex 
molecules. This and the negative electron affinity makes noble gases at- 
tractive detector gases. Argon provides a larger number of primary elec- 
trons than helium and neon, and is more affordable than krypton or xenon, 
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See section 12.4. 


The second Townsend coefficient gives 
the number of electrons, which can be 
released on average from a surface by 
an incoming ion. 


The mean free path is proportional to 
the molecular density n, which in a gas 
is given by n ~ p/(kpT). 


In principle, this should also include 
recombinations, although in practice 
these are less relevant, because ion 
densities are usually still low, and the 
electric fields tend to separate electrons 
and ions in the avalanche. 


Similarly, for the number of positive 
ions d(N;,) = (Ne) & dx, and 


(Neon ()) = aa (esn 1) 


The number of negative ions can be 
found from 


(Nion (*)) = (Ne(X)) — (Nion (2))- 
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and thus is chosen as the main gas component for many standard gaseous 
detectors. 

In mixtures of noble gases and gases with low ionisation potentials (in 
gaseous detector typically the quencher, see below) the gain is increased 
by Penning transfers, where excited states of the noble gas de-excite and 
the energy is used to ionise the other gas, 


A*+B5A+BT +e. 


The increase in gain can be described by a modification of the Townsend 


coefficient 
OPpenning = & (1 + FPenning ) , 
fion 


where @ is the Townsend coefficient without transfer, i.e. only due to the 
collision cross-sections, rpenning the Penning transfer rate, which is the 
probability that an excited noble gas atom A ionises a molecule B, fexc 
is the sum of the production rates for the excited states of the noble gas 
A that have a larger excitation energy than the ionisation threshold of the 
molecule B, and fion the sum of the production rates of the direct ionisa- 
tions [438]. Similar arguments apply for ternary mixtures [437]. 

Our use of averages reflects the fact that the avalanches are created 
by stochastic processes, and thus, for the same initial charge, their size 
will vary. In the absence of attachment and space charge effects, and for 
moderate fields, the ionisation coefficient can be assumed to be constant. 
In that case it can be shown that the probability density for an avalanche 
of size Ne after a distance d, starting from a single electron, will fall 
exponentially, 


-l eNe(d)/(Ne(d)) 
P(Ne|d) (N, (d) e f 
Including attachment does not change the shape of the distribution (for 
sufficiently large (N.)), but there is a finite probability that the avalanche 
fizzles out (NV. = 0), if the few electrons at the start of the avalanche are all 
getting attached. The exponential distribution of the remaining avalanches 
adjusts to still satisfy eq. (4.12) [328]. 

If there is a number of m initial electrons the total amount of charge af- 
ter amplification is the result of the superposition of individual avalanches, 
if the gas gain is sufficiently low that the space charge created in the am- 
plification does not distort the local field. For large m the avalanche size 
distribution will approach a Gaussian, in accordance with the central limit 
theorem. 

However, the assumption of a constant ionisation coefficient is not 
correct, because the ionisation cross-section depends on the energy of the 
electron. As the electron typically emerges from the previous collision at 
a low energy, and only gradually increases its energy in the accelerating 
field, the probability for an ionising collision will vary along its path. In 
addition, there is competition from other inelastic processes (excitation), 
which will dominate at lower electron energies. The energy spectrum of 
electrons in the avalanche shifts to higher energies for higher values of the 
reduced field E /p, and thus the probability for ionisation increases. 


(4.13) 
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Helium is used in gaseous detectors if 
low material (multiple scattering) is 

a design driver. Xenon is used when 
detection of X-ray photons is required, 
for example in transition radiation 
detectors (see section 12.5). 


For example, in Ar/COy mixtures ex- 
cited states of Ar from 3p°3d and 
higher can ionise CO2. 


See problem 3. 


The standard deviation of the avalanche 
size in this case is equal to the mean, 
(Ne(d)). 
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Several attempts have been made to model this behaviour, most of 
them parameterising the ionisation coefficient as a function of the distance 
since the last collision. The results of these models are typically described 
as Pólya distributions, 


(0 y 1)?" 


P(Neld, 8) = yr +1) 


( Ne(d) ) ê (0n tela 
(Ne(d)) ' 
for large Ne. The parameter 0 is usually determined from a fit to the data. 
The agreement with data is satisfactory, but the exact physical interpre- 
tation of this parameter is under debate [212, 43]. For 0 = 0 eq. (4.14) 
reverts to eq. (4.13), for 8 — œ it becomes a Poisson distribution. 

Again, Monte Carlo simulation is the most powerful tool to deal with 
the complexities of the physics [447]. 
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In the avalanche ionisation is again not the only possible outcome of 
the collisions. Sometimes, a hit atom will end up in an excited state, and 
subsequently de-excite with the emission of a photon. Gaseous detectors 
are typically optically transparent, and this photon can travel macroscopic 
distances. At some point, it will interact with the gas or the detector walls, 
releasing a photoelectron, which in turn will become the start of a new 
avalanche. When this process repeats, it leads to a continuous discharge in 
the detector, rendering the detector useless. 


(4.14) 


87 


Figure 4.17: Energy distribution (ar- 
bitrary scale) of avalanche electrons 

in inelastic collisions, and inelastic 
cross-sections. Energy distributions are 
computed using Monte Carlo simulation 
for different homogeneous fields in 
methane (modified from [447]]). 


For an overview see [43]. 


Figure 4.18: Avalanche size distri- 
bution for methane [212]. Circles: 
measured for E / p = 156 Vem~'Torr“|, 
E/(QWyon) = 5.3. Squares: Pólya dis- 
tribution for 0 = 1 (arbitrary ordinate 
units). 


This is called ‘photon feedback’. 


88 
There are two ways around this issue: 


e The high voltage for the amplification field is supplied via a resis- 
tor. When the detector goes into discharge, the increase in current 
will increase the voltage drop across this resistor, so that the field 
in the detector drops to levels below what is required for avalanche 
multiplication. This is the principle of the Geiger-Miiller counter. 


The drawback of this approach is that the removal of the charge 
through the large resistor takes a long time (>us), during which 
the detector is blind to other particles (‘dead time’). 


e A polyatomic gas like CH4 or i-C4H jo is added that has a large ab- 
sorption coefficient for photons in the few-eV range, by excitation 
of rotational and vibrational levels. Ultimately, the energy is then 
safely absorbed in relaxation, dissociation or elastic collisions of 
these molecules. Such an additive is called a ‘quencher gas’. A 
secondary function of a good quencher is that its atoms exchange 
charge with ions of the main gas so that these ions cannot strike the 
cathode surface, where they can release secondary electrons. For 
this its ionisation potential needs to be lower than for the main gas. 
The ionised quencher molecule does not emit a secondary electron 
at the cathode, but instead neutralises and de-excites or dissociates. 


Equation (4.12) implies the proportionality of the signal charge with 
the primary charge generated in the gas volume by the incoming particle. 
Typical gas gains for proportional avalanche multiplication are 10 to 10°. 
However, at higher fields so many secondary electron-ion pairs can be cre- 
ated in the avalanche that their collective space charge modifies the local 
electric field. The field is increased in the regions of large charge densities 
at the front and the tail of the avalanche, and reduced in between. At this 
point the proportionality starts to break down. At even higher fields and 
thus gas gains space charge effects together with photons from recombina- 
tions in the region of reduced field can create macroscopic discharge chan- 
nels or ‘streamers’, for which the signal becomes completely independent 
of the amount of primary charge. This is called a ‘limited streamer mode’. 
In this mode the typical gas gains are between 10’ and 10°. 

Finally, at extremely high fields the detector will go into plasma dis- 
charge, at which operational stability can only be restored by external re- 
duction of the voltage (Geiger-Miiller mode). Empirically, the discharge 
happens once the avalanche from a single electron has reached exp(ax) 
particles with ax ~ 20, or about 5 x 108 particles in the avalanche. 
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Figure 4.19: Discharge-limiting resistor 
in a Geiger-Miiller counter. 


i-C4H\o is the empirical formula for 
iso-butane, (CH3)2CHCHs3. 


A historic approach to limiting the 
discharge in a gaseous detector by elec- 
tronic means were spark or streamer 
chambers. Here, the high voltage was 
turned on inside the detector for a short 
time in response to an external trigger 
signal that indicated the passage of 

a particle through the chamber. The 
detector created visible (and audible) 
sparks, which were photographed. 

This type of detector was used from the 
1930s to the 1960s (for more informa- 
tion see [428]). Today it is sometimes 
used as a demonstration device. 


This is called the ‘limited proportional- 
ity’ regime. 

Typically this happens once the space 
charge-induced electrical field becomes 
of similar strength as the external field. 


In geometries with strongly varying 
external field (for example around a 
wire) the growth of the streamer will be 
stopped once the external field becomes 
too small to sustain the streamer. 


This is known as the Raether 
limit [416]. 
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relative electric field 


Figure 4.20: Schematics of avalanche development in gases for small avalanches and proportional mode (left), medium avalanches and limited 
proportionality (middle), and large avalanches and streamer formation (right). x is in the direction of the field. Eo is the field in the absence of the 
avalanche space charge. 


4.6 AVALANCHE MULTIPLICATION IN SEMICONDUCTORS 


Unlike in gases, impact ionisation in semiconductors can occur for both 
types of charge carriers, electrons and holes. If the mean free path for a 
charge carrier is À, then a charge carrier drifting in an electric field of 
strength E will on average acquire an energy AE. If this energy is suffi- 
ciently large, the carrier can cause further ionisation, hence we have the 
possibility of charge multiplication in a similar way as in gases. Generally, 
multiplication factors for electrons and holes are not the same and it does 
depend on the type of semiconductor which one is larger at a given field 
strength. For silicon, the multiplication factor for electrons is much larger 


than for holes. The difference between impact ioni- 
5 š r : PON 3 ` sation rates for electrons and holes is 
We will look at practical application of avalanche multiplication in important for low noise semiconductor 
semiconductors in chapter 9, once we have discussed the principles of avalanche detectors: 


doping of semiconductors. 


106 106 Figure 4.21: Ionisation rates at 300 K 
300 K ` 5 3 ‘ 
—— Electrons a, versus reciprocal electric field for 
---- Holes @, N Si, GaAs, and some IV-IV and III-V 
compound semiconductors [468]. 
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Key lessons from this chapter 


e The movement of charge carriers is caused by electric fields and governed by colli- 
sions with the detector material. 


In principle the drift velocity is proportional to the electric field, with the mobility as 
the proportionality factor. In practice the mobility is not a constant, but depends on 
the physics of the collisions. 


In gases electrons can acquire significant energy between collisions. The large range 
in energy, together with strong variations of the collision cross-section (Ramsauer 
effect) and the energy loss fraction leads to non-trivial behaviour of the drift velocity. 
For ions, the drift velocity is simpler, but much slower (by about three orders of mag- 
nitude). 


The size of a cloud of electrons is increased by diffusion. It generally grows with the 
square root of the drift distance. In gases, the diffusion coefficient is at the thermal 
limit in ‘cold’ gases like COz and larger in ‘hot’ gases like argon. 


In liquids high electron drift velocities can be obtained for materials with molecules 
with a high polarisability (typically these have a high degree of spherical symmetry). 
Typical examples are larger noble gases and some organic liquids. The challenge is 
the required purity. 


Typical solid particle detectors are made of semiconductors, because of the availabil- 
ity of pure material, and the band gap resulting in only moderate amounts of charge 
carriers in the conduction band, which can be removed by appropriate means. In 
solids, electrons and holes can contribute to the net movement of charge. 


Magnetic fields affect the drift of charge carriers. If the magnetic field is orthogonal 
to the electric field, the drift direction gets rotated by the Lorentz angle. In parallel 
fields, the transverse diffusion is reduced. 


If the energy acquired between collisions is large enough, secondary ionisation oc- 
curs, leading to an avalanche growth of the charge in the detector. 


EXERCISES 


1. 


3. 


Transport properties in gases. 


Estimate the mean free path for electrons in a gas at STP. (You might find useful that the molar 
volume for an ideal gas is 22.4 x 107°? m?/mol and the diameter of atoms is of the order 10~!° m). 


Assuming that the thermal component of the electron velocity dominates, what is the mean veloc- 
ity of the electrons and the time between two collisions? 


. Drift in magnetic fields. 


Show that eq. (4.8) is a solution to the Langevin equation, eq. (4.7). 


Avalanche size distribution. 


In this problem you will derive the avalanche size distribution P(n|d), that the avalanche comprises 
n electrons after a distance d (eq. (4.13)). 
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Assume that the Townsend coefficient varies with the location, œ = a(x). (It is sufficient to think 
about this in 1D.) The starting point for the avalanche is one electron at x = 0, i.e. P(1|0) = 1. 


a) First, find the probability P(1|x) that a charge has not created a secondary ionisation at all 
between x = 0 and x =x. 


b) If the probability that the avalanche contains n — 1 electrons at x =.’ is P(n—1,x’), show that 
for small Ax’ the probability that one and only one of these electrons will ionise between x’ 
and x’ + Ax’, so that there are n electrons, can be written as (n— 1)a(x’)Ay’. 


c) Show that the probability that none of these n electrons will ionise in the region between x + 
Ax’ and x is exp[—n({j a(x") dx” — fo a(x”) dx”)]. 


d) Find an equation for P(n|x) by multiplying these expressions and integrating over x’. Using 


Ne= 2 nP(x|d) = elo a(x) ax 


n=1 
reproduce eq. (4.13). 


4. Efficiency and dark current of a PMT. 


The typical quantum efficiency for a PMT with semitransparent bi-alkali photocathode is close to 
25% in the spectral range between 350 and 400 nm. 


a) If 10 photons impinge on the photocathode, what is the probability that no photoelectron is 
produced? 


b) If the PMT is coupled to a slab of plastic scintillator (9 = 1.05 g/cm? and a light yield of 10 
photons/keV), and the probability of a scintillation photon from the scintillator to reach the 
photocathode is 10% (due to collection and coupling efficiencies), how thick does the slab 
have to be to achieve 95% detection efficiency for minimum ionising particles? 


Cc 


wm 


Estimate the transit time (the time from the creation of the photoelectron to the arrival of 
the signal at the anode), if the PMT has 10 stages with a separation of 10 mm, and the inter- 
dynode voltage is 200 V. Assume that at each dynode the electrons are created at rest. 


d) The PMT has a gain of 10° and displays a dark current of 1 nA. What is the rate of thermal 
electron emission from the photocathode? Why is this an issue, and what can you do to get 


this under control? 


w~ 


5. Gain stability of a PMT: 


The gain of a dynode in a 10-stage PMT as a function of the inter-dynode voltage Vayn can be pa- 
rameterised as Ggyn % vis. If the overall voltage with which the PMT is operated is 1 kV, what is 
the acceptable operating voltage fluctuation, if the overall gain of the PMT is to stay within 1%? 


5 Response to excitation 


So far, we have discussed signals from the collection of ionisation charge 
produced by an incoming charged particle. As long as the volume is large 
enough to provide a sufficient amount of charge, the detection elements in 
such detectors can be made small, leading to detectors that are capable of 
precisely locating the passage of charged particles. However, sometimes 
position resolution is not the primary target, but a large dense detector vol- 
ume or cheap instrumentation of a large volume is required. In that case 

it is often easier to collect optical photons, which have a long range in 
transparent materials. Such photons can be generated in detector materi- 


als called ‘scintillators’ from the relaxation of excited states created by For an early, but comprehensive intro- 
es k i i i i duction to scintillation see [145]. 
the electromagnetic interaction of the incoming charged particle with the 
detector. The other source of optical photons 
: i fate in particle detectors is Cherenkov 
To achieve this long range, the energy of the excitations needs to be radiation, as discussed in section 2:8 


degraded, so that the energy of the photon is insufficient to cause further a ii Cherenkov and 
excitations in the detector material, resulting in the loss of the photon. 

Depending on the nature of the material, we distinguish between or- 
ganic and inorganic scintillators. 


5.1 ORGANIC SCINTILLATORS 


In organic scintillators it is the de-excitation of molecular electrons that 
generates these photons. Organic substances that contain aromatic rings, 
such as polystyrene (PS) and polyvinyltoluene (PVT), do display this 
effect. Because scintillation in organic scintillators is happening within 
molecules, the actual state of the material is not that relevant, and there are 
plastic, crystalline, liquid and gaseous organic scintillators. 

The molecules typically have a spectrum of electronic singlet and 
triplet states, with a typical separation at the level of eV, plus a fine struc- 
ture of vibrational levels with a ten times smaller spacing. 


* Singlets Triplets Figure 5.1: Schematic operating prin- 


ciple of an organic scintillator. Left: 
Potential energy as a function of atomic 


degradation ; separation. Right: Energy levels. 

SS S ES o T“ 
g ie ——— 
5 emission H j Idared 
8 | ~< delaye 
a) H as : a 
S K 
4 T= 

S 


d 


At room temperature most electrons are in the Sg singlet ground state. 
The electromagnetic interaction with an incoming particle lifts the elec- 
trons into an excited state somewhere in this spectrum. The singlet ex- 
citations decay very quickly (<10 ps) to the S* first excited state (inter- 
nal degradation), which subsequently does not decay to the electronic 
ground state, but typically to one of the vibrationally excited So levels, 
with the emission of a photon on the scale of ns (‘prompt fluorescence’). i in wavelength is called ‘Stokes 
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Response to excitation 


Re-absorption of the emitted photon is suppressed, as its energy is less 
than the transition from the ground state So + S*. 

While singlet-triplet transitions are forbidden by electric dipole selec- 
tion rules, a small number of molecules will become excited in a triplet 
state. Such states are metastable and the de-excitation to the ground state 
So takes in the order of ms. The metastable state can also be lifted to an 
excited S* state (by thermal excitation or another charge particle colli- 
sion), which then decays promptly as discussed above. This process is 
called ‘delayed fluorescence’. 

To achieve larger macroscopic transmission distances, additional or- 
ganic fluorescing compounds are added, which degrade the photon energy 
further. The first step in this degradation cascade usually happens on such 
small distance scales that the energy is not transferred by photons, but by 
a resonant dipole-dipole interaction (Förster energy transfer). The primary 
scintillator and the wavelength shifting additives are chosen so that the 
emission and absorption energies of the different stages match. 


Ionization excitation of base plastic n 
base plastic 


Forster energy transfer 


primary fluor 
- (~1% wt/wt ) 
emit UV, ~340 nm 


absorb UV photon secondary fluor 
. (~0.05% wt/wt ) 

emit blue, ~400 nm 

lm Yy 


absorb blue photon photodetector 


The prompt decay of the excited state has an exponential decay char- 
acteristics, L œ exp(—t /T). Because organic scintillators involve relaxation 
of electronic states within one molecule the prompt fluorescence emission 
process can be fast (a few ns), and organic scintillators are often used in 
timing applications, like detectors used for triggers and detectors for time- 
of-flight measurements (see section 12.2). If delayed fluorescence occurs, 
another exponential with a longer time constant needs to be added. 

The light output of scintillators does saturate for high excitation densi- 
ties. This effect is described by the empirical Birks’ formula [144] for the 
light yield per unit length of the incoming particle’s path, 


dL A (£) 
de 1+kB (E) 


(5.1) 


where A is a proportionality factor that describes the proportionality of en- 
ergy loss and light yield at low excitation density, k is the probability of 
quenching, and B is a proportionality factor between the energy loss and 
the density of already excited molecules. Together, kB is called the ‘Birks’ 
constant’, which is a property of the scintillator and is found experimen- 
tally. 

For scintillators with more than one decay time constant the different 
components typically have different values of the Birks’ constant, hence 
the signal shape will differ for particles with different energy loss. 

The light yield of organic scintillators is usually fairly stable under 
temperature variations (between 0 and 3 x 10-3 per °C [403]). Gener- 
ally, the light yield from scintillators decreases with temperature, due 
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This is called ‘phosphorescence’. 


These substances are also called ‘wave- 
length shifters’. For a detailed re- 

view of modern scintillator materials, 
see [129]. 


Figure 5.2: Photon energy degradation 
in an organic scintillator [505]. In 
detectors in which the photon sensors 
are more than 1 m away from the 
scintillator, it is common to employ an 
additional wavelength shifter to convert 
the light from blue to green (this has a 
significantly longer attenuation length). 


Lis the light output. 


Only in very fast scintillator applica- 
tions the finite risetime due to excitation 
and degradation can become relevant. 


Birks’ law applies also for inorganic 
scintillators. 


For example, for polystyrene 
kB = 0.126 mm/MevV. 
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Figure 5.3: Variation of the specific 
A fluorescence, dL/ dx, with the spe- 
y NEAR RESEONSE cific ionisation loss, (1/p)dE /dx, 
in anthracene crystals. The solid 
curve represents eq. (5.1) with 

kB =5.3 x 107°? cm/MeV [168]. 
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to thermal quenching, caused by non-radiative de-excitation or thermal 
ionisation of the luminescent centres. In the first case the state is excited 
to a level where the energies of the electronic states of the excited and 
the ground state cross, so that no energy is released in the transition be- 
tween the two, with a subsequent de-excitation of the vibrational excita- 
tion within the ground electronic state. In the second case the excited state 
receives sufficient thermal energy that it becomes ionised before it can Figure 5.4: Non-radiative de-excitation. 
decay radiatively [306, 335]. 

A widely used type of organic scintillator are plastic scintillators. 
There the scintillator molecules get dissolved in a styrene or vinyltoluene 
monomer which is then polymerised to create a solid plastic. Plastic scin- 
tillators are produced by casting or extrusion and can be conveniently 
made in many different shapes and sizes. For use, they are typically 
wrapped in a reflective and light-tight coating. In principle plastic scin- 
tillators are robust, but mechanical stress can cause the development of 
micro-cracks, which cause photon loss. 

Liquid organic scintillators can have good scintillation yields and long 
attenuation lengths. They are thus easy to use and affordable for filling out 
a large active volume, and thus they are an attractive detector medium for 
neutrino experiments (see section 14.2). High purity is required for good 
light yield and contamination with water, oxygen and solvents must be 
avoided. 


5.2 SCINTILLATING FIBRES 


If a good spatial resolution of the scintillation is required, the obvious ap- ane opiera rie of scintil- 
proach is to reduce the size of the scintillator elements. A practical ge- 
ometry for this are scintillating fibres. In principle, diameters down to a 
few 100 um are possible, but because of the trade-off between granularity 
of the position information and light output, often larger diameters (up to 
1 mm) are chosen. Scintillating fibres can be made of appropriately doped 
polystyrene or glass, or they can be liquid scintillator contained in capil- 
laries. 

A benefit of the fibre geometry is that the light can be transferred 
along the fibre by total internal reflection to a junction with a photon de- 
tector. Mechanically, individual fibres are flexible, although this flexibility 
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is reduced once they are collected in bundles. Active (scintillating) fibres 
can be spliced onto passive (light-guiding) fibres. 

To provide a controlled surface for the internal reflection the fibres 
are usually clad with a thin (< 5 um) layer of transparent material with 
a lower index of refraction. Photons with an angle larger than Ogit = 
arcsin (7c1aq/Mcore) to the surface of the interface, where ngjaq and Neore 
are the index of refraction of the cladding and the core, respectively, will 
be transported along the fibre. Theoretically, the fraction of collected light 
emitted on the axis of the fibre in one direction of the fibre is given by 


1 1/2—-Ocy} 1 
f f ‘Qn sin6 dO = 5 (1 arcsin (528) : 
0 


4r "core 


A typical combination is a polystyrene core (core = 1.59) with a cladding 
of poly(methylmethacrylate) (PMMA, ncaa ~ 1.49), and the fraction 

of collected light in this case can be 3%. Double cladding with an addi- 
tional outer low-n cladding layer (for example with a fluoroacrylic with 

n = 1.42) can improve the trapping of the light in the fibre to about 5%. 

A concern for closely packed fibres is cross-talk, where light from one 
fibre is collected and propagated in an adjacent fibre. Usually, photons 
outside of the collection cone of a fibre cannot be collected by a parallel 
fibre. However, if the photon activates a wavelength shifter in the other 
fibre, the isotropically re-emitted shifted photon can be in the acceptance 
cone for propagation. Small-diameter scintillating fibres therefore usually 
contain only one wavelength shifter with a large Stokes shift (for example 
1-pheny1-3-mesityl-2-pyrazoline, PMP) and thus little overlap between 
emission and absorption bands. The drawback is a smaller light yield. An- 
other approach is to deposit a thin layer of aluminium over the surface of 
the fibres. This eliminates cross talk and also increases the light yield. 


5.3. INORGANIC SCINTILLATORS 


Inorganic scintillators are transparent crystals, usually mixed with some 
dopant (for example Nal, doped with TI). The interaction with the incom- 
ing charged particle lifts electrons well above the band gap and leaves 
deep holes in the valence band and below. The high excitations lose en- 
ergy in inelastic collisions, while the holes gain energies from Auger tran- 
sitions, until all charge carriers are contained in the valence and conduc- 
tion bands. 

These states then thermalise to the lower edge of the conduction band 
(electrons) and the top of the valence band (holes). They then form exci- 
tons (electron/hole states just inside the band gap),which travel together 
until they get captured as a whole by impurity centres (‘activators’). These 
usually have been deliberately introduced, creating locations within the 
crystalline structure that are referred to as ‘luminescence centres’ or 
“emission centres’. Finally, the excited activator states decay, emitting a 
visible or UV photon, which can travel long distances as their energy is 
smaller than the band gap. In addition, the wavelength of the light emit- 
ted by the activator is often better matched to the spectral sensitivity of the 
photon detector. 

In some crystals the band structure does inhibit the thermalisation of 
deep hole states, so that there is a significant probability of electrons in 


(5.2) 
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This is beneficial because clear fibres 
have longer attenuation lengths than 
scintillating fibres. 


Figure 5.5: Cross-talk in bundles 

of scintillating fibres (modified 

from [215]). Left: scintillator with two 
wavelength shifters (p-Ter+POPOP). 
Right: scintillator with one wavelength 
shifter (PMP). Both bundles are illumi- 
nated with a UV laser from the right. 
The fibre diameter is about 30 um. 


Inorganic scintillators have been used 
from the early days of studying radioac- 
tivity. It were barium platinocyanide 
plates that allowed Réntgen to make 

his first observations of X-rays in 

1895 [431]. 


Because the process requires move- 
ment over significant distances, crystal 
scintillators are usually not very fast 
detectors. 


For example, in pure Nal photons with 
a wavelength of 303 nm are created. 
The activator Tl changes the photon 
wavelength to 450 nm. 


An example for such a material is BaF. 
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Figure 5.6: Relaxation scheme for electronic excitations in an insulator: e, electrons; h, holes; ph, phonons; hv, photons; Vk, self-trapped holes; STE, 


self-trapped excitons; c" , ionic centres with charge n. Occupation density is represented by grey and white areas for electrons and holes, respectively 
[326]. 


the conduction band falling into these deep holes, resulting in a very fast 
emission of a UV photon. The challenge in exploiting this fast signal is the re _— called ‘cross- 
high energy of the photon, which is usually outside the sensitivity of most l 
photocathodes. 
Not all crystals require doping to achieve scintillation: Bismuth Ger- 
manium Oxide (BGO), for example, is a pure inorganic scintillator with- 
out any activator impurity. There, the scintillation process is due to an op- 
tical transition of the Bi>* ion, a major constituent of the crystal. 
The energy of the emitted photons from inorganic scintillators has a 
broad spectrum because of distortions of the emission centres due to lat- 
tice deformations, and because of temperature broadening of the optical 
transitions. Typically, between 10 and 100 eV are required per photon pro- 
duced, depending on the band gap. Practically, the energy required per de- 
tected photon is larger than that, because of photon loss due to absorption 
and on surfaces, and inefficiencies in the detection. 


140 Figure 5.7: Photon yield/keV of several 
inorganic scintillators as a function of 


the band gap [326]. 
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In the time domain the signal is again described by a decaying expo- 
nential, although the time constants are longer than for organic scintil- 
lators due to the time required for thermalisation of the charge carriers. 
Often crystal scintillators display two time constants: one for fast recombi- 
nation (10~°-10~° s) from activation centres, and the other from delayed 
recombination due to trapping (1073-1076 s). 


Table 5.1: Some common scintillators (data from [505, 316, 512]). 
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we Light yield Peak wavelength Decay time Density dL/dT K 
Scintillator [keV] fan] [ns] [g/cm?] [% : c Hygroscopic 
Nal:Tl 43 410 245 3.67 —0.2 yes 
CsI 1.5/0.5 310 30/6 4.51 —14 slightly 
CsI:Tl 52 550 1220 4.51 0.4 slightly 
LSO:Ce (Luz2SiOs:Ce) 27 402 41 7.40 —0.2 no 
LYSO:Ce (Luy.g Yo.2Si0O5:Ce) 33 420 40 7.10 no 
BGO (Bi4Ge3012) 8.2 480 300 7.13 —0.9 no 
PWO (PbW04) 0.13/0.03 425/420 30/10 8.30 —2.5 no 
BaF 1.8/10 220/310 0.6-0.8/630 4.88 0.1/—1.9 no 
LaBr3:Ce 61 356 17-35 5.29 yes 
Plastic scintillator (typ.) 10 425 2 1.03 0 to —0.3 no 
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Figure 5.8: Material properties of inorganic scintillators [345]. 
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The temperature dependence of the light yield for inorganic scintilla- 
tors is typically larger than for organic scintillators, and is quite significant 
for BGO and PWO (1-2% per °C). In addition to the light yield also the 


decay time of the scintillation can change with temperature. 


300 


100 150 200 
Relative LY (normalized to LYSO:Ce) 


Temperature stabilisation is therefore 


needed for these crystals, if proportion- 
ality of the signal is required. 
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BaF, (220 nm) 
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Another complication is that some crystals are hygroscopic and the 
ingress of water can damage the crystal structure close to the surface, re- 
sulting in dead zones and poorer reflectivity of the surface. 

Inorganic crystal scintillators can be very dense (for example, PoWO4 
has a density of more than 8 g/cm*), which makes them well-suited for 
detectors where high stopping power is required, and in particular for the 
use in homogeneous calorimeters (see section 11.3). 


5.4 SCINTILLATION IN LIQUID NOBLE GASES 


Liquid noble gases can also be scintillators. An incoming particle can cre- 
ate excitons or electron-hole pairs in the liquid. Collisions with neutral 
atoms can then create diatomic excited or ionised molecules. In argon, 
for example, these can be an excited state Ar} or an ionised state Ar; . 
The ionised state can capture an electron, leading also to the neutral ex- 
cited state Ar}. This state decays non-radiatively to an intermediate level, 
which then decays radiatively to the ground state, emitting a scintillation 
photon in the process. For liquid argon, the wavelength of this photon is 
centred around 127 nm. This is far in the UV region, beyond the sensi- 
tivity of standard photon detectors, and thus shifting of the wavelength is 
usually required. A suitable wavelength shifter is tetraphenyl-butadiene 
(TPB) [172]. 

The liquid noble gas is transparent for its own scintillation light, be- 
cause the typical distance between atoms in the liquid is about a factor 2 
larger than the separation of the two atoms in the excimer, and thus the 
reverse of the emission process is highly unlikely. 

The excimers can occur in singlet and triplet states, which have similar 
energies but very different decay lifetimes, resulting in two time constants, 
The lifetime for the singlet states is short (a few ns), but the lifetime for 
the triplet states can be considerably longer (up to 1 us). The excitation 
produces approximately 40 x 103 y/MeV in liquid argon. 


5.5 PHOTON DETECTION AND COUPLING 


The actual number of detected photons from a scintillation detector is 
given by the light yield of the scintillator times the light collection ef- 
ficiency and the quantum efficiency of the photon detector. The light 
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Figure 5.9: Light yield as a function 
of temperature for some inorganic 
scintillators (for BaF only the 220 nm 
emission is shown) [367]. The curves 
are normalised to the light yield at 

20 °C for each material. 


These excited states are sometimes 
called ‘excited dimers’, or ‘excimers’. 


For liquid krypton the wavelength is 
about 150 nm, and for liquid xenon 
about 178 nm. 


This argument is only valid for pure 
argon. Contaminants can reduce light 
transmission. 


46 x 10° y/MeV in liquid xenon. 
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collection efficiency is affected by the attenuation of the light within the 
scintillator and light guides due to finite transparency, as well as losses 
incurred during the total internal reflection on the inner walls of light- 
guiding elements. 

In practical applications the scintillator is connected optically to the 
photon detector via a light guide, in which the light is conducted by inter- 
nal reflection [506]. To couple efficiently, Liouville’s theorem (Liouville’s 
theorem demands that the volume in phase space stays the same) needs 
to be satisfied, which means that the cross-section of the light conduc- 
tor should stay constant up to the photon detector. This target can lead to 
creative solutions for light guide geometries. If it cannot be achieved, the 
collection efficiency will be reduced by a factor up to App /As, where App 
and As are the photon detector and scintillator facing cross-sections, re- 
spectively. Large and complex scintillator and light guide geometries can 
lead to significant path length differences for the photons, degrading the 
timing resolution of the system (c ~ 20 cm/ns in the material). 

One approach to achieve reduction of aperture in a light guiding sys- 
tem is to couple the (large-area) scintillator via a small air gap to a smaller 
light guide, which is doped with a wavelength shifter (typically from blue 
to green). The re-emission from the wavelength shifter is isotropic, but 
with the smaller phase space due to the smaller dimensions of the light 
guide. 

Good coupling of the light guide to the photon detector is achieved 
with optical grease or optical pads. In modern scintillator systems, the 
light transfer is often via optical fibres that are doped with wavelength 
shifters. The benefit of such a light transfer is a high mechanical flexibil- 
ity of the fibres, that the optical connections can be distributed more uni- 
formly within the scintillator, and that the optical path length can be more 
uniform. 

Photon detectors commonly used are Photomultiplier tubes (PMTs) 
for large segmentation, and Avalanche Photodiodes (APDs) for small seg- 
mentation. Typical efficiencies for PMTs are about 30%, and for APDs 
up to 80%. Other, more recently developed photon detectors are Silicon 
Photomultipliers (SiPMs) or Multi-Pixel Photon Counters (MPPCs). 

Ideally, the peak emission energy of the scintillator matches the peak 
quantum efficiency of the photodetectors used, typically 250-500 nm for 
photomultipliers (see also Figure 4.14) and 450-900 nm for solid state 
photodetectors (p-i-n diodes and avalanche photodiodes). 


5.6 RADIATION DAMAGE IN SCINTILLATORS 


Radiation can affect scintillators in several ways: First and foremost, it 
can cause radiation-induced absorption by creating colour centres that trap 
photons in the scintillator and thus reduce the attenuation length. Second, 
it can introduce radiation-induced phosphorescence (afterglow), and fi- 
nally, in organic scintillators it can damage the scintillation mechanism, 
degrading the light yield and changing the emission (and the absorption) 
spectrum. 

In organic scintillators radiation can create free radicals, which 
strongly absorb UV photons. The radicals usually will anneal with time, 


but can polymerise with other radicals, in which case the radiation damage 
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Surface imperfections can result in 
losses even for angles that would cor- 
respond to total internal reflection for a 
perfect surface. 
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Figure 5.10: Light guide geometries. 
Top: Adiabatic light guide. Bottom: 
Fishtail light guide. 


Optical grease is usually a transpar- 
ent silicone grease with an index of 
refraction close to plastic scintillators 
(n = 1.465). Optical pads are soft sili- 
cone pads with a few mm thickness with 
a similar index of refraction. 


See section 4.4. 


See section 9.9. 


For a discussion of these devices, to- 
gether with their strengths and weak- 
nesses, see section 9.9. 


Other photodiodes using InGaAs or 
InGaAsP as used in telecommunications 
have sensitivity up to 1550 nm. 
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becomes permanent. Diffusion of oxygen into plastic scintillator during 
the annealing leads to the creation of carbonyl and hydroxyl groups, which 
absorb at longer wavelengths, resulting in reduced transmission. In plas- 
tic scintillators it is the base material that gets damaged, the dopants are 
usually stable [169]. 

In halide crystals the dominant damage mode is again oxy- 
gen/hydroxyl contamination. In oxide crystals it is structure defects, such 
as oxygen vacancies [512]. 

The radiation hardness can also sometimes be improved with suitable 
doping. For example, in PoWOx, doping with niobium prevents the trap- 
ping of holes on oxygen near a lead vacancy [273]. 

Recovery of the light yield due to annealing occurs already at room or 
application temperature, but the radiation damage may also be reduced by 
heating the crystals (thermal annealing) or by exposing the crystal to light 
(optical bleaching). As a consequence of the annealing the radiation dam- 
age effects in most crystals are rate dependent. Under normal operation 
there is significant annealing such that for a given dose rate an asymptotic 
equilibrium between damage and annealing is obtained. 

Radiation damage is less of a concern in liquid scintillators, as the 
medium usually can be exchanged, and contaminants need to be removed 
in any case, whether they are generated by radiation or by other sources. 


5.7 COMPARISON OF SCINTILLATION AND CHERENKOV 
RADIATION 


Both Cherenkov radiation and scintillation light comprise photons in the 
visible range, and they can be recorded by similar photon detectors. How- 
ever, they are fundamentally different. Cherenkov radiation is directly A material often used similarly to in- 


k £ i : organic crystal scintillators is lead 
generated by the incoming charged particle, and thus the emitted photons glass (SiO3/PbO), for example in ho- 


correlate with the direction and the velocity of the incoming particle. The — geneous calorimeters. However, the 
2 , % r 7 photons created there are not from scin- 

relevant property of the detection medium is the index of refraction. The tillation, but from Cherenkov radiation. 

photons from Cherenkov radiation are created instantly when the incom-  *s light output is two to three times 


x i 3 lower than crystal scintillators. 
ing particle traverses the detector material. 


As we have seen in chapter 2, excitations are a prominent manifesta- 
tion of charged particle energy loss, whereas Cherenkov radiation, when 
the conditions for its creation are met, constitutes only a small contribu- 
tion (~10~3). 

Scintillation light is coming from a secondary emission from the relax- 
ation of excited states in the medium. It is thus isotropic, and the emission 
of scintillation light, in particular from inorganic scintillators, is delayed. 
The energy of the scintillation photons is governed by the energy levels 
in the molecules or lattice of the scintillator. Optimisation is therefore a 
matter of chemistry. 

However, the two signals can occur concurrently in response to the 
passage of a charged particle in a scintillator, and if they do they are some- 
what complementary, which can be exploited in detectors with dual read- 
out (see section 11.11). 
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Key lessons from this chapter 


e The key feature of a scintillation detector material is the shifting of the energy of 
the emitted photon from the energy associated with the energy gap involved in the 
photon creation (usually in the UV) to longer wavelengths. This is necessary to obtain 
photons that can traverse macroscopic distances in the scintillator, and to match the 
spectral sensitivity of the photon detector. 


This is usually achieved by doping. 


In organic scintillators this is achieved by doping with wavelength shifters. 


In inorganic scintillators the doping creates luminescence centres. 


In general, organic scintillators are faster, cheaper and are less dependent on tempera- 
ture, and inorganic scintillators are denser and thus have a higher stopping power. 


e The properties of different inorganic scintillators vary significantly. A fast, radiation- 
tolerant inorganic scintillator, PhbWO 4, was developed for use at LHC. 


As usual, there is no universal solution. The choice of technology is driven by the 
specific requirements of the experiment. 


EXERCISES 


1. Collection time for scintillation light. 


How much time is required to collect 90% of the total light yield for a) NaI:T1, b) PoWO;, c) a 


typical plastic scintillator? 


2. Total internal reflection. 


A scintillator is made of a slab of polystyrene (n = 1.59) of infinite area. Calculate the fraction of 
the light that escapes from either surface of the slab, and hence the fraction of the light that will be 


trapped in the scintillator. 


3. Scintillating fibres. 


A minimum-ionising particle is passing through a scintillating fibre with a doped diam- 
eter of 300 um. (The stopping power for a minimum ionising particle in polystyrene is 
1.936 MeV g7 !cm?.) 


a) Estimate the deposited energy and the number of scintillation photons created in the fibre. 


(Peore = 1.05 g/cm?). 


b) Estimate the number of photons that will arrive at the end of a 1 m long scintillating fibre, 
if the index of refraction of the core and the cladding is 1.59 and 1.48, respectively, and the 


attenuation length is 2 m. 


6 Detection of ionisation 
without charge movement 


So far we have discussed particle detectors that ultimately produce an 
electrical signal from the collection of charges. However, there are detec- 
tors in which the ionisation charges produced by the passage of a charged 
particle cause a local change to the molecular structure of a photographic 
emulsion, or a local change to the physical state of the detector medium, 
resulting ultimately in an image that reflects the path of the incoming par- 
ticle in the position of these localised changes. Interpretation of these im- 
ages requires optical survey that produces the data that is then used in the 
analysis. 

Detectors of this type have been historically very important, but today 
they have mostly been superseded with detectors with electronic read- 
out, which are faster, can cope with high particle rates, and can easily be 
incorporated in electronic triggers. Nevertheless, these technologies still 
do provide some of the most accurate position measurements for charged 
particles available, and thus there are even today examples of these tech- 
nologies used in experiments that do not involve high detection rates. 


6.1 EMULSIONS 


Nuclear emulsions, like conventional photographic film, consist of sil- 
ver halide crystals (mostly bromide with a small admixture of iodide) im- For a more thorough discussion of 
ý J i : j . the chemical processes in emulsions 

mersed in a gelatinous or polymer carrier. The size of the crystals is typi- see [67]. 
cally between 0.1 um and 1 um. When a halogen atom in the crystal gets 
ionised, the electron can combine with a silver ion resulting in a neutral 
silver atom. Several silver atoms then coalesce and form a small cluster. At this point we speak of a ‘latent 
Exposure to a development solution will make the silver clusters grow, aT 
until they cover a macroscopic area that can be resolved by eye. In a final 
development step another chemical, the fixer, removes the remaining silver 
halide crystals, but leaves the silver clusters in place. After a final wash to 
remove all chemicals the pattern of silver clusters representing the image 
is permanent. The gelatinous carrier maintains the position of the silver 
clusters throughout the process. 

The average energy loss of a charged particle is insufficient to develop 
a silver halide grain, but ionisation events with a large energy transfer 
(6-electrons) are usually contained within a grain and deposit all their en- 
ergy there, which is sufficient to start the chemical process. Because of the 
statistics of these high-energy transfers, only a fraction of the grains along 
the path will get developed. To address this, nuclear emulsions differ from 
regular photographic film, in that they contain a higher, more uniform den- 
sity of smaller silver halide grains, and are thicker. The grain density is 
about 1014 grains/cm?, and each grain is essentially an independent detect- 
ing element. 

The spatial resolution of a single grain can be described by the 
RMS of the response distribution, which can be idealised as a binary 
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distribution that is 1 for a spherical volume of the grain with diame- 

ter D, and 0 outside. It is found similarly as eq. (3.7), and is given by 
RMS = \/xD/8. With a grain diameter of 0.2 um, this results in a the- 
oretical spatial resolution of 44 nm. The availability of a large density of 
high-precision space points is the strength of emulsion detectors. 

Nuclear emulsions do display random noise, which is due to thermal 
noise, gelatine impurity and over-sensitisation. In addition, as there is usu- 
ally a delay between the manufacture of the emulsion and its use in an 
experiment, there will usually also be a number of background tracks from 
that period. However, latent images disappear with time due to oxidation 
(a process called ‘fading’). This process accelerates at high temperature 
and humidity, and by the addition of 5-methylbenzotriazole the disap- 
pearance of background tracks can be accelerated without affecting the 
sensitivity of the material [385]. 

Photographic emulsions have been used historically in particle physics 
(the first observation of subatomic particles has been made by Becquerel 
by accidentally exposing photographic plates to beta radiation from Ura- 
nium salts [124]). Today they are still in use where high position resolu- 
tion for low charged particle densities and rates are needed (i.e. neutrino 
experiments). 


Emulsion detectors have been developed into very high resolution 
tracking detectors with three-dimensional reconstruction capabilities by 
arranging emulsion layers into stacks, which are usually interleaved with 
passive material layers, like plastic or metal plates. Multi-layer stacks of 
emulsions are often employed as high-resolution active targets together 
with electronic detectors (scintillation counters, trackers or calorimeters), 
which give information to localise interesting vertices in the emulsions 
and complement the kinematic information of the events. An intermediate 
(double-)emulsion layer (referred to as a ‘changeable sheet’, CS) can be 
used to make a decision on whether a section of the ECC should be un- 
packed and developed. 

The ECC target for the OPERA experiment was made of films with 
a total surface of 110,000 m? and 105,000 m? lead plates (target mass of 
1.25 kt), relying on industrially produced, machine-coated emulsion films 
with very uniform layer thickness [385]. 
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This operation is called ‘refreshing’. 
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Figure 6.1: Image of one of Becquerel’s 
photographic plate after exposure to B 
radiation from uranium salts [125]. 


Figure 6.2: Image of a neutrino vertex 
in the emulsion target of the CHORUS 
experiment [232]. The image size 
corresponds to about 100 x 100 um? 
(this is the typical scale of the size of 
one pixel in a silicon pixel detector). 


Such an arrangement is called an 
emulsion cloud chamber (ECC). 
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The other important technological development allowing for the ex- 
ploitation of such large emulsion detectors is the development of auto- 
mated scanning microscopes. Typically, the focal length of these micro- 
scopes is about 5—10% of the thickness of the emulsion, allowing for an 
automated recording of the three-dimensional track within the emulsion. 
The scanning of the events in the OPERA experiment was performed with 
40 fully automated microscopes which allowed to scan about 500 m? of 
emulsion in 5 years. 

Among all tracking devices used in particle physics, nuclear emulsion 
particle detectors feature the highest position and angular resolution in 
the measurement of tracks of ionising particles, in particular for the detec- 
tion of short-lived particles. They are particularly useful when very high 
resolution is needed, but at low speed and cost (for example in T neutrino 
experiments). 


6.2 CLOUD AND BUBBLE CHAMBERS 


Another class of detectors that historically has been very important for 
particle physics relies on local phase changes caused by the passage of a 
charged particle in a homogeneous detector medium that has been pre- 
pared in a metastable state (supercooled vapour or superheated liquid). 
The pattern of generated droplets or bubbles are then recorded using pho- 
tography and analysed after scanning of the image. 


BUBBLE CHAMBERS 


A superheated liquid is unstable and will tend to minimise its Gibbs po- 
tential by evaporation. However, this process is resisted by the surface 
tension of the emerging vapour bubbles. For a bubble to grow it must have 
a minimum size given by the critical radius re = 2y(T)/AP, where y(T) 

is the surface tension at temperature T and AP is the pressure difference 
between the pressure of the vapour inside the drop and the pressure in the 
liquid. Therefore, to create a bubble locally a spike of energy deposited in 
the liquid must occur, which is sufficient to instantly create a bubble with 
a radius larger than the critical radius. Theoretically the energy required 


for this is between a few eV in helium to more than 100 eV in propane. The 


energy actually required is significantly higher and found from counting 
of bubbles, to be 0.5—1 keV, and thus requires a high-energy ionisation 
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Figure 6.3: Schematic drawing of an 
ECC detector (OPERA) [232]. 


For example, the discovery of strange 
particles in a cloud chamber [429], 

the confirmation of the quark model 

by the observation of the Q` particle 
in a bubble chamber [114], or the 
discovery of the neutral weak current in 
the Gargamelle bubble chamber [282]. 


See problem 1. 


The energy must be deposited within a 
volume given by the critical radius. 
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event (6-electron) from the passage of a charged particle through the liq- 
uid [452]. 

One side effect of the bubble formation is the creation of a sound wave 
in the liquid. This sound can be picked up with microphones, and from 
early on has been used to trigger the camera systems that take the photo- 
graphic images of the bubbles in the chamber medium. 

The main challenge with bubble chambers is that the superheated state 
is very fragile. Nucleate boiling commences instantly at any contamina- 
tion and in particular at any surface feature of the containing vessel, con- 
ditions which locally reduce the threshold radius for bubble growth. This 
was an acceptable situation in accelerator-based bubble chamber exper- 
iments, as the active time did not have to exceed the duration of a bunch 
train from the accelerator (fractions of a second). In this case the cycling 
frequency of the bubble chamber was synchronised with the beam struc- 
ture, typically up to a few Hz. 

An example for the large last-generation bubble chamber experiments 
is the Big European Bubble Chamber (BEBC) at CERN [279]. This cham- 
ber was designed to operate with 35 m° of hydrogen, deuterium or oxygen 
as active liquid at temperatures between 25 and 36 K. Events in the ac- 
tive volume of 3.7 m diameter and 2 m height were recorded by four wide 
angle cameras on top of the experiment. The double-walled pressure ves- 
sel was designed for a pressure of 10 bar. Superheating was achieved by 
rapidly withdrawing a piston with 180 cm diameter and a mass of 1.8 t 
by 10 cm (an expansion of 0.7%,). The maximum duration of an expan- 
sion/compression cycle was 45 ms, and the maximum acceleration 200 g. 
To allow for a momentum measurement of charged tracks in the chamber, 
the experiment was placed between two superconducting Helmholtz coils, 
which produced a vertical field of 3.5 T in the tracking volume. 


Camera mount & 
feedthrough 
ï 


4 


Outer vacuum vessel 


hn wal 


Camera windows 


105 


At the time this was the largest super- 
conducting magnet system in the world. 
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Figure 6.4: Big European Bubble Chamber (BEBC). Left: Cross-section of the detector (modified from [171]). Middle and right: Event display with 


production of a D meson in neutrino interactions [170]. 


For the analysis the photographic images created in these bubble 
chambers had to be scanned by semi-automated scanning machines un- 
der control of human operators and the kinematic parameters of the tracks 
extracted. To obtain a full three-dimensional reconstruction several images 


These large bubble chamber experi- 
ments also were important in the devel- 
opment of organisational patterns that 
are followed also today: Large, com- 
plex and technologically challenging 
experiments at a central facility, with 
reconstruction and analysis performed 
at collaborating institutes. 
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from different angles were taken. A major limitation of the photographic 
imaging is the limited focal length of the cameras. To overcome this, the 
conventional photographic images have been supplemented with holo- 
graphic images as they ideally have infinite focal length [287]. 

A modern application for bubble chambers is in searches for WIMPs. 
In these experiments there is no correlation with an accelerator duty cy- 
cle, and the goal is to maximise the time during which the chamber stays 
active. Hence, the detectors are optimised to delay spurious boiling, in par- 
ticular on surfaces. This is achieved by enclosing the working fluid (for 
example, CF3I) in a container of synthetic fused silica with high surface 
smoothness. One approach to prevent the working fluid from contacting 
any remaining metallic parts in the drive mechanism (piston) was to cover 
the working fluid with a layer of ultra-pure water as a buffer medium. 
With such a setup the chamber can be kept in a stable superheated state 
for several hundred seconds. 

More recent versions of this experiment have eliminated the layer of 
water because of the acoustic signatures this layer created. The separation 
of the superheated working fluid is now achieved by a temperature gradi- 
ent in the fluid, where the part of the liquid in contact with metallic parts is 
kept cold below the saturation point [165]. 

A dark matter signal would be observed in such an experiment as a nu- 
clear recoil from a neutral particle. Such an interaction leaves only a single 
bubble at its interaction point, differently than the tracking bubble cham- 
bers used historically, which recorded the tracks from charged particles. 
Bubble chambers employed in the search or WIMPs are operated at com- 
binations of temperature and pressure that make the detector insensitive to 
minimum-ionising radiation. 

Another way to avoid the contact of the superheated liquid with any 
solid surface is by dispersing droplets of the working liquid in an insol- 
uble gel or viscoelastic medium. Such a detector is called a Superheated 
Drop Detector (SDD) [61]. The energy required to make a bubble grow 
when a charged particle is passing is mechanical energy stored in the liq- 
uid, hence this detector does not need any power, once its pressure is set. 
Hence SDDs can be used for dosimetry, in particular for neutrons (“bubble 
detector’) [436]. 


CLOUD CHAMBERS 


A cloud chamber is filled with a gas (typically air), which is mixed with 
another substance in the saturated vapour state. If the temperature is low- 
ered, the vapour will become super-saturated and condensation com- 
mences [218]. However, due to the surface tension, the saturation vapour 
pressure over a curved surface with radius r is larger than over a planar 
surface and given by 


Psvp(7) _2y 1 
oo E 


where y is the surface tension, R the gas constant, n the molar density and 
T the absolute temperature. If the droplet has a radius less than 2y/ (nRT) 
(for water this is about 1 nm), the saturation vapour pressure above the 
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A recent example is the PICO experi- 
ment [56]. 
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droplet is significantly higher than for a flat surface, and thus for reason- 
able levels of supersaturation such a small droplet would immediately 
evaporate, rather than grow. However, if the droplet contains an electric 
charge e, for example from the ionisation due to the passage of a charged 
particle, eq. (6.1) is modified due to the electrostatic repulsion of the 


charge density, 
2 
i ( Psvp(r) ) _ 1 (2 e ) l 
Psvp() nRT \ r 32m €qr4 

The potential energy of the drop increases as it grows in size due to the 
surface tension, but at the same time it decreases due to the electrostatic 
repulsion. The result is that the supersaturation required to make a small 
charged droplet grow is significantly reduced, which allows small droplets 
to form. If the supersaturation increases, these droplets can grow, until a 
critical radius rc is reached. Beyond this level of supersaturation (for water 
S ~ 4, corresponding to an expansion ratio of 1.25) the droplet dimension 
becomes unstable and the droplet grows rapidly to visible size. This is the 
principle of the cloud chamber. 

Two types of cloud chambers exist: In expansion chambers (also called 
“Wilson chambers’), sub-cooling is achieved by lowering the pressure in 
the active volume of the detector. The vapour expands adiabatically and 
thus cools down, until it becomes super-saturated. The chamber is sensi- 
tive until the new equilibrium is reached. The main drawback of expansion 
cloud chambers is the long time such a chamber usually needs until the 
original equilibrium is reinstated after a complete expansion/compression 
cycle. A further improvement to the Wilson cloud chamber was the use of 
Geiger-Miiller counters to trigger the expansion of the chamber and the 
operation of the cameras photographing it [146]. 

In diffusion cloud chambers there is a vertical temperature gradient, 
and the warm vapour either rises, or, more commonly, falls towards the 
cold region of the chamber, where it becomes super-saturated [388]. A dif- 
fusion cloud chamber can only be operated stably if the total density of the 
gas-vapour mixture increases towards the bottom to prevent convection. 
Diffusion cloud chambers usually use alcohol instead of water vapour, be- 
cause of its lower freezing point. The main advantage of diffusion cham- 
bers is that they are continuously sensitive (as long as the required temper- 
ature gradient can be maintained), but the vertical extent of the sensitive 
area is limited. 

Today, cloud chambers have little scientific value, but they are often 
used for demonstration purposes, because of their continuous sensitivity, 
the absence of the need for electrical power, and the aesthetically pleasing 
and intuitive presentation of the charged particle tracks. 

However, a large cloud chamber experiment with a volume of 26.3 m’, 
CLOUD, is currently in operation at CERN, to replicate and study at- 
mospheric cloud formation. The contribution of ionising cosmic rays to 
droplet growth is part of these studies [307]. 


(6.2) 
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Figure 6.5: Ratio of saturation vapour 

pressure over a drop and saturation 
vapour pressure over a plane as a 

function of the droplet radius for water 
vapour (data from [218]). 


C.T.R. Wilson developed the expansion 
chamber in 1911, and with it was the 
first to make individual charged particle 
tracks visible [499]. For this work he 
was awarded the 1927 Nobel Prize in 
Physics. 


The Wilson cloud chamber was also the 
first detector used in human radiotracer 
studies that measured the transit time of 
a radioactive tracer in blood from one 
arm to the opposite arm in the human 
body [149]. 


See for example [502]. 
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Key lessons from this chapter 


e The energy transfer from a passing charged particle can create local changes to the 
chemical or physical state of a suitably prepared detector medium. The location of 
these changes reflects the trajectory of the incoming particle. 


e The local changes can be captured by optical imaging technologies, with subsequent 
manual or automatic scanning of the images for extraction of the kinematic proper- 
ties. 


e Usually, the energy required to achieve the change of state is large and requires en- 
ergy transfer in the tail of the charged particle energy loss distribution (6-electrons). 


e These type of detectors have been historically important, but have still applications in 
neutrino physics and dark matter searches. 


EXERCISES 
1. Bubble growth in a superheated liquid. 


The surface tension of the liquid in a bubble chamber at temperature T is y(T). 


a) Show that a spherical bubble of radius r possesses an effective surface pressure 2y(T )/r. 
Hence show that a stable bubble will have a radius r. = 2y(T)/Ap, where Ap is the pressure 
difference between the pressure of the vapour inside the drop and the pressure in the liquid. 


b) The Gibbs free energy G to form a drop of radius r can be expressed as the difference of a 
surface term and a volume term. Find these terms and their dependence on r, and make a 
sketch of G as a function of r. At what value of r is the Gibbs free energy at a maximum? 
What will happen if the radius varies from this value? 


c) Show that the energy required to form a bubble with a radius re is 


16r% (T) 
~— 3(Ap}? 


2. Supersaturation due to expansion. 


In a cloud chamber of Volume V; just before expansion, there is a non-condensible gas and vapour 
at a pressure pı at temperature Tı, with a mass M; in the volume and a molar mass m. 


a) First assume that just before expansion, there is non-condensible gas at a pressure pg, and va- 
por at a pressure pı contained in a volume V; at a temperature Tı. Find the equation of state 
of the vapour, if M; is the total mass of vapour present in the volume V; and M the molecular 
weight of the vapour. 


b) Find the temperature T, and thus the equation of state after an adiabatic expansion to a vol- 
ume V (before any condensation takes place). 


c) However, this is an unstable state since at the lower temperature T, the amount of vapour that 
may be held in suspension is less than at the temperature 7; and some of the vapour will con- 
dense, reducing the mass in the vapour from M, to M2. After the condensation equilibrium is 
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d) 


again reached and the vapour pressure falls to the saturation pressure p2 at the lower temper- 
ature 77. The equilibrium temperature T is slightly higher than 75, the temperature immedi- 
ately after expansion, as some heat is liberated by condensation of vapour. Find the equation 
of state in the new equilibrium state. 


The supersaturation S is the ratio of the density of vapour after expansion but before con- 
densation, ps = Mı /Və, to the saturation density at the lower temperature, p2 = M2 /V2, 
S = p} / p2. Show that, if the slight difference between T) and Ty is neglected, 


1 Y 
s=2 (37s) ? 
p2 \1+E 


where 1+ € = V2/V; is the expansion ratio, and y = cp/cy, the ratio of the heat capacities at 
constant pressure and volume, respectively. 


7 Gaseous detectors 


When a charged particle passes through a gas, it will create ionisation 

charges as described in section 2.3. If an electrical field is applied, the 
charges will separate and drift towards the electrodes (see section 4.1), 
which induces electrical signals on the electrodes as discussed in sec- 

tion 3.1. 

Because of the low density of gases, the amount of charge produced in 
a given path length is small, so that the charge typically needs to be col- 
lected over a larger volume, and internal amplification (see section 4.5) 
is usually required for gaseous detectors. On the other hand, the low den- 
sity has the advantage that multiple scattering is less of a concern than in 
denser detector media. Gases do require containers, but there is flexibility 
in the shape of these containers as is in their size, and gases will readily 
fill out these shapes. Gases can also be easily replaced, so contamination 
and radiation damage effects can be reduced, as long as the damage prod- 
ucts are not solidifying inside the detectors (see section 7.7). 

In this section we will first discuss typical compositions of detector 
gases, and then discuss common geometries, starting with a simple planar 
geometry. The most common gaseous detector is the wire chamber, where 
the high fields required for internal amplification are produced close to a 
thin wire. The position resolution of simple wire chambers is given by the 
cell size and micro-pattern gas detectors (MPGDs) have been developed to 
reduce these. 

A further refinement of the position information can be obtained when 
the finite time it takes the primary electrons to drift to the anode is taken 
into account, leading to the concept of the ‘drift chamber’. Drift chambers 
are today the best tool for large area/volume, low-mass and affordable 
tracking detectors. 

Finally, we will discuss the issues arising from the operation with high 
rates over extended times, and software that allows to predict and optimise 
the performance of gaseous detectors. 


7.1 GASES FOR DETECTORS 


To satisfy the different performance requirements for gaseous detectors it 
is common to use mixtures of gases. Typical components in these mixtures 
and the motivation for their use are: 


e The main component is usually a heavy noble gas, because of the 
high charge yield and electron mobility they provide. Argon is the 
most affordable of these, thus is the most widely used main compo- 
nent of detector gases. 


e In applications where a large primary ionisation density, and thus a 
large mass density, is required, heavy halocarbons are used as the 
main gas component, as they are dense and inert. An example is 
1,1,1,2-tetrafluoroethane (C2F4H2). 


DOI: 10.1201/9781003287674-7 


Halocarbons are compounds in which 
one or more carbon atoms are linked 
by covalent bonds with one or more 
halogen atoms. 


Figure 7.1: 1,1,1,2-tetrafluoroethane. 
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e A quencher is usually required to prevent photon feedback and dis- 
charges in the detector (see section 4.5). Good quencher gases (for 
example CH; or i-C4Hj9) are often flammable, non-flammable alter- 
natives are CO% or CF4. 


Isobutane (i-C4Ho) is an isomer of 


butane. H H H 


H-6. -GH 
e Cold gases like CO, might be used if reduction of diffusion is de- H È H 
sired. Dimethylether (DME) is another gas if low diffusion and low E H 


drift velocity 1S required. Figure 7.2: Isobutane. 


e CF, is also added sometimes because it can remove surface contam- H H 
pa fs : \ / 
ination from polymerisation of the detector gas (see section 7.7). Pe cae “C.H 

\ l 
H 


e If the detector should be sensitive for X-rays, then the gas should 
contain atoms with high Z, like for example xenon. If optical or 
near-UV photons need to be detected (for example in RICH detec- 
tors), TEA or TMP can be added to the gas (see section 7.6). 


Figure 7.3: Dimethylether. 


All components are usually non-polar molecules, to avoid loss of electrons 
due to attachment, and they are chosen to be chemically inert and stable, 
and not strongly affected by radiation. Further concerns for the applica- 
tion of a specific gas can be safety (flammability), and the global warming 
potential (which is in particular a concern for halocarbons). 

While in principle more than two components in the mixture are possi- 
ble and tertiary mixtures are commonly used, control of the mixing ratios 
and thus achievement of stable drift and amplification properties becomes 
more challenging if more gases are used in the mixture. 


Gas Density atSTP (dE/dx) mip Er Welion Np Metal Table 7.1: Sas of typical detector 
[10-3 g/cm] [kV/cm] [W] [V] [em] [em] gaves at NTT (AE d2) a Wen 

power for minimum-ionising particle, 

He 0.18 0.32 24.6 41.3 3.5 8 Er: ionisation energy, We/jon average 

Ne 0.90 1.45 21.6 37 13 40 energy per electron/ion pair, Ny number 

Ar 1.78 2.53 15.7 26 25 97 of primary clusters (see section 2.5) 

Xe 5.89 6.87 121 2 41 312 arie. ee TAFAL IRET OJ eee 
pairs, for unit charge minimum-ionising 

CH4 0.72 1.61 12.6 30 28 54 particles (from [505, 443]). 

C2H6 1.28 2.91 11.5 26 48 112 

i-C4H10 2.51 5.67 10.6 26 90 220 

CO2 1.96 3.35 13.8 34 35 100 

CF4 3.78 6.38 16.0 54 63 120 

C2F4H2 4.25 7.52 12.2 82 


7.2 RESISTIVE PLATE CHAMBERS 


The simplest charge collection geometry is a parallel plate capacitor. How- 
ever, the primary ionisation collected in a gas gap of reasonable thick- 


ness would be too small for direct detection, and thus avalanche ampli- ‘Too small’ here means not distin- 

; . y . 5 : coy . i guishable from the input noise of an 
fication is required. In a practical design to achieve stability against dis- electronic amplifier. 
charge one or both of the HV electrodes are made of resistive material, 
so that the field collapses locally, once significant currents are generated Such a detector is called a ‘Resistive 
by the avalanche. In that case the field decays exponentially with a time Rie CCE TEED 


constant T = P£;€o, where p is the volume resistivity of the material (see st RPC will be blind locally for this 
` 7 . n 2 ‘ime. 

exercise 6). Topica paie materials are glass (with a typical volume re- Bakehte isa hermozering phenol 

sistivity of p = 104 Qcm and £, ~ 2.25 resulting in Tœ 1 s), or Bakelite formaldehyde resin, and was the first 


: Pi 10 ha ; ; Pa type of plastic made from synthetic 
(with P= 10 Qcm and Zia 10, which gives T= 10 ms). components (invented in 1907). 
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Typically, the main gas component in RPCs is 1,1,1,2-tetrafluoroethane 
(C2F4H2), with a small fraction of i-C4Hj9 as a quencher gas. RPCs are 
operated in avalanche mode or in streamer mode. In the latter case the 
output signals are much larger (between 10s of pC to a few nC), at the 
cost of rate capability (a few 100 Hz/cm) compared to avalanche mode 
RPCs, which can operate at rates a factor 10 higher. The addition of small 
amounts of SF¢, which has a high electron affinity, in particular for low 
electron energies, suppresses the development of streamers [175]. 

RPCs are fast detectors, as the detectable signal is generated by 
avalanche amplification immediately after the primary ionisation charges 
are liberated by the incoming charged particle. Hence, in large particle 
physics experiments RPCs are used in two main roles, as timing detectors 
or as trigger detectors, and there are two main flavours of RPC designs to 
satisfy the respective requirements. 

In trigger detectors the main goal is high efficiency. For this, they 
have larger gaps (typically a very few mm), with electric fields of a few 
ten kV/cm. A commonly used gas is C2F4H2/i-C4H10/SF6 96.7/3/0.3. Sin- 
gle and double gap geometries are being used. Efficiencies of 99% and 
time resolutions of the order of | ns are typically achieved. 

For timing applications, the gas gap of the RPC is small 
(250-300 um) with very high fields (around 100 kV/cm). The high field 
demands extremely good surface quality of the resistive plates, leading 
to the choice of glass for the resistive electrodes, and also a gas mixture 
that is more heavily quenched and limiting streamer development (a com- 
mon gas mixture is CyF4H»/i-C4Hj0/SF¢ 90/5/5). In a single gap the time 
resolution is limited by long tails in the time distribution caused by the 
stochastic fluctuations at the start of the avalanche process. For a time 
measurement that is robust against these tails, multi-gap geometries with 
typically five layers are used. The glass plates are stacked within conduc- 
tive electrodes. No conductive connections to the intermediate plates are 
required, as they acquire the correct potential by electrostatic charge di- 
vision. The time resolution achieved in such detectors is around 100 ps. 
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Tetrafluoroethane is used in the refrig- 
eration industry, where it is known as 
R-134a. 


See section 4.5. 


SFs is industrially used as electrical 
insulator and arc suppressant. It is also 
a potent greenhouse gas. 


Detectors with good time resolution 
can be used for time-of-flight (TOF) 
measurements (see section 12.2). 


The time resolution in RPCs is caused 
by statistical fluctuations of the 
avalanche growth and is proportional 
to the effective Townsend coefficient 
a — n (see section 4.5), and the drift 
velocity [425]. 


About the time it takes light to cross the 
detector. 
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Figure 7.4: RPC geometries. Left: Typical trigger RPC [425]. Right: Typical multi-gap timing RPC (after [36]). 


7.3 WIRE CHAMBERS 


The classic geometry for gaseous detectors is the ‘wire chamber’, where 
the amplification field is generated close to thin anode wires (E œ 1 /r). 
The electrons from the primary ionisation first drift to the wire, where 
avalanche multiplication takes place in the high field close to the wire. The 
ions generated in the multiplication process then drift back from the wire 
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towards the cathode. Because of the large drift time it is this movement of 
the ions that generates the largest part of the charge signal for typical wire 
chamber geometries. The electrostatics and signal generation in a cylindri- 
cal wire chamber are studied in exercise 3. 

A simple implementation of a wire chamber for position detection 


over a larger area is the Multi- Wire Proportional Chamber (MWPC), George Charpak received the 1992 _ 
i . : i Nobel Prize in Physics for his invention 
which is a row or an array of cells with one anode sense wire each. and development of the MWPC. 
0.15 py Figure 7.5: Field and equipo- 
T tential lines in a cross-section 
a through a planar MWPC (mod- 
= 0.05 ified from [505]). The wires are 
5 orthogonal to the picture plane. 
‘2 0.00 For an analytical treatment of the fields 
& in MWPCs see [234]. 
> -0.05 


-0.3 -0.2 -0.1 0.0 0.1 0.2 0.3 
x-axis [cm] 


The spatial information in a MWPC is given by the binary information 
of the cell hit or not hit. Thus the position resolution of such a detector 
is d/ V12, where d is the cell size. In practical detectors, the cell size is 
limited due to the effect of mechanical tolerances on the gas gain and due 
to the mechanical instability of the wires at high voltage. 

In a MWPC geometry the anode wires are in an unstable mechanical 
equilibrium. Once one of the wires moves towards the cathodes, electro- 
static repulsion will push the next wire to the opposite side and so on. This 
movement will lead to variations of the field and hence the gain, and in 
the worst case, to discharges that will render the chamber inoperable. This Figure 7.6: Wire displacement due to 
displacement is resisted by the wire tension. ils aa 

It can be shown that the position of the wires is maintained as longas See exercise 1. 
the tension of the wires is larger than 


a 1 /CVL\? 

crit = Ane ( R ) , 

where C is the capacitance, Vo the potential of the wire, L the length of the 
wire, and s the spacing between the anode wires [475]. 

The maximum tension that can be applied to a wire is limited by the 
properties of the wire, depending on the material and the radius, and 
thus, for a given length of wire a minimum spacing between wires must 
be maintained to prevent the displacement of the wires. Practically, wire 
spacings down to a very few mm are feasible, and thus the achievable po- 
sition resolution of MWPCs is in the order of 1 mm. 

To calculate the signal on the wire in a MWPC we can use the Ramo- 
Shockley theorem, eq.(3.1). Due to the proximity of the avalanche to the 
wire, the induced current due to the electron movement is very short, and 
contributes little to the induced charge on the electrodes. It is the motion 
of the ions that provides the bulk of the charge signal, but because of the 
long drift time of the ions the duration of this signal is usually truncated 
by an RC circuit. See exercise 3. 
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To simplify the calculation of the signal we describe all electrodes as 
parallel wires or the superposition of these, which allows for a straight- 
forward two-dimensional approach. In that case, the linear charge den- 
sity on each wire can be found from the voltages of all wires V, from 
An = YmCnmVm, where Cnm is the symmetric capacitance density matrix. 
The electric field close to wire n can then be found from the voltages 


E (r, ja An Lan CnmVm 
nS nI Oe n 2nEorn ` 


The position of a drifting ion as a function of time in the vicinity of the 
wire, where the electric field is to good approximation cylindrically sym- 
metric, is given by 


2 
t ; a, TE 
r(t)=an/1+— with to = =, 
(t) ý to Uìn 


where apn is the radius of the wire. To find the induced current on the 
wire on which the avalanche took place we need the weighting field 

E? (ra) = Cnn/(2€orn), and the induced current due to the movement of 
ions with a total charge Ne is 


dr, (t) 


s Necnn 
In (t) = —NeE} (Tn (t)) dt = 4T E(t +to) f 


Similarly, for a different wire, the weighting field is E” (rn) = Cnm/ (27T £0rn), 


and the induced current is 


, dr,(t) Necn 
In(t) = -NeEw(rn(t)) E = at T 


The Cy», have the opposite sign to Cnn, and thus the induced signals on the 
wires without the avalanche have the opposite polarity as on the wire with 
the avalanche, as the ions move away from the wire with the avalanche 
and towards the other wires in the system. 

The nature of the ions drifting back from the avalanche changes over 
time due to charge exchange reactions and due to ion clustering [298]. In 
the first kind of process ions from molecules with a high ionisation poten- 
tial transfer their charge to other components in the gas that have a lower 
ionisation potential (for example Ar* + CO2 —> Ar + Coy ). In the second 
type of process several molecules and ions form a charged molecular clus- 
ter (for example Ar* + Ar — Art - Ar or COŻ +nCOz > COF - (COd),,). 
The charge transfer occurs on a time scale of 1 ns, whereas the cluster- 
ing happens within 10 ns. Both timescales are short compared to the usual 
drift time of the ions towards the cathode, so that most of the signal in a 
wire chamber is given by the drift of these ionisation clusters, and it is the 
drift velocity of these clusters that determines the signal induced. 


7.4 MICRO-PATTERN GAS DETECTORS 


One way to deal with the dimensional effects and the electrostatic insta- 
bility issues of thin wires in MWPCs is to mount the conductors on an 
insulating substrate. Advances in micro-fabrication led to the development 
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The capacitance density is the ca- 
pacitance per unit length. Cnm is the 
capacitance density between wires n 
and m. 


See also exercise 3. 


There is also usually an additional 
capacitive cross-talk between the elec- 
trodes in the system. 


To maintain energy/momentum con- 
servation in these processes a third 
spectator molecule must be part of the 
reaction. 


As discussed in section 4.1, the drift 
velocity is smaller for particles with 
higher mass. 
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Time [s] 


of a class of gaseous detectors that comprises microelectronic structures 
with sub-millimetre distances between anode and cathode electrodes. Such 
detectors are called ‘Micro-pattern gaseous detectors’ (MPGDs). 

The first representative of this class was the ‘Micro-Strip Gas Cham- 
ber’ (MSGC) [390], in which the electrodes are metal traces deposited 
on an insulating substrate like glass. However, this type of detector suf- 
fers from various instabilities ranging from time-dependent gain shifts 
to discharges between the electrodes, and charges generated during the 
avalanche multiplication that stick to the insulating substrate. Some of 
these charging effects can be remedied by using a substrate with con- 
trolled high resistance [159]. However, the problem of discharges, induced 
by extremely large primary ionisation events due to rare nuclear reactions 
or slow moving heavy ions associated to high charged particle fluxes, was 
never really resolved. These discharge events can have catastrophic conse- 
quences for the delicate conductive structures. 

A whole range of other MPGDs have been proposed, which in ad- 
dition to the reduction of dielectric material in the detector to alleviate 
charging issues also often use the capabilities of micro-fabrication to pro- 
vide two-dimensional spatial information. A number of those still suffer 
from discharge and uniformity issues, and few have been developed be- 
yond small bench prototypes. However, two amplification structures based 
on foils have emerged from these studies, which are now widely used. 


MICROMEGAS 


Micromegas (Micro-mesh gaseous structures) [262] consist of perforated 
metal foils, which are placed close (50—100 um) to an anode plane, with a 
uniform high field in the gap (typically several 10 kV/cm). 

Electrons from a drift volume enter the amplification gap through 
the holes in the micromesh, where a gain of up to 10° can be achieved. 
To maintain the correct distance of the foil to the anode plane spacers 
are located between them. Originally, the spacers were placed on the 
anode plane by lithography of a photoresistive film and the mesh was 
stretched and glued on a frame and then placed on top of the spacers. 
In an optimised production scheme the mesh is laminated on the pho- 
toresistive film, before the material between the spacers is etched away 
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Figure 7.7: Computed evolution of the 
fraction of ions and clusters in Ar/COz 
mixtures (initial Ar* fraction is 90%). 
The different lines are for different 
gas pressures (solid: 1 bar, dashed: 
0.75 bar, dotted: 0.5 bar) [298]. 


Figure 7.8: Field lines and equipo- 
tentials in an MSGC computed for a 
back-plane voltage close to the cathode 
voltage [454]. 


Figure 7.9: Discharge damage in a 
MSGC [272]. 


For an overview see [441]. 
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| Figure 7.10: Schematics of a Mi- 
i 1 HV1 cromegas detector [505]. 
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(‘bulk micromegas’ [263]). The thin gap and the high field result in a very 
fast signal from the ion drift (about 100 ns). The uniform field leads to 
uniform gain, and thus to good energy resolution for example for soft X- 
rays. 

Again, the most demanding situation is the operation in a high-rate 
environment of charged particles, which creates discharges. Due to the ab- 
sence of any fragile conductive structures damage to the micromesh is not 
significant enough to prevent continued operation of the device. However, 
the increase in current due to the discharge can exceed the limits of the 
HV supply, which requires a typical dead time of a few ms to recover. 

Because of this, resistive micromegas detectors have been devel- 
oped [40]. This design does not eliminate the chances of sparking but in 
the event of a spark being formed, it will strongly limit the current and 
hence avoid damage and reduce the droop of the high voltage. 


Mesh support pillar Resistive Strip Figure 7.11: Schematic of a resistive 
0.5-5 MO/cm Micromegas detector [40]. 
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Insulator Copper Strip 
0.15 mm x 100 mm 


The induced currents on the readout electrodes can be read out with 
low noise amplifiers. Alternatively, a very powerful detector with high 
spatial resolution in orthogonal directions can be built using such ampli- 
fication structures on top of a highly segmented silicon pixel readout chip 
(see below). 


GAS ELECTRON MULTIPLIER 


Gas Electron Multipliers (GEMs) [440, 442] are based on polyimide foils, Polyimides (PIs) are high-temperature 


: é r : : : engineering polymers. There are differ- 
clad with copper on both faces. First, a tight grid of holes is chemically ent flavours of polyimide with varying 
etched in the copper. Then a different chemical is used to etch the poly- physical and chemical properties avail- 
imide th h. The thick fth lyimid d the di f th able from different manufacturers. A 
imide through. The thickness of the polyimide and the diameter of the widely used polyimide flm material is 


holes are (100 um). Kapton™. EO 
For operation, a voltage is applied between the two faces of the foil, a PEE A 

which generates a high electric field that is sufficient to induce impact 

ionisation when electrons pass the hole. A potential difference across the 

polyimide of a few hundred Volts will generate a sufficiently high local 

electric field to achieve charge amplification. To ensure high gains, the 
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optimum hole diameter should be of a similar size as the thickness of the 
foil. A fraction of the charge is lost because of attachment to the dielectric, 
and thus there is an effective gain that is smaller than the actual avalanche 
gain. A gas gain of 10° can be achieved with small GEM foils, but more 
typically effective gains are a very few 100 with one foil. Higher gains can 
be achieved by stacking several foils. The advantage of this approach is 
that a large overall gain can be achieved with lower voltages across the 
individual layers. The charge created in one hole in one layer spreads to 
several holes in the next, and thus the Raether limit (see section 4.5) can 
be avoided and the chance of electrical breakdown reduced. 
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Figure 7.12: Left: Electron microscope 
picture of a section of typical GEM 
electrode, 50 um thick. The hole pitch 
and diameter are 140 and 70 um, 
respectively. Right: Field lines in a 
GEM [442]. 


Due to the high attainable gain GEMs 
can be used with a CsI coating to ef- 
ficiently detect the single electrons 
released by ultra-violet photons for 
example in Cherenkov detectors. 
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Figure 7.13: Triple-GEM. Left: Schematics. Right: Effective gain (full line) and discharge rates (dashed line) as a function of voltage [442]. 


Ultimately, the amplified charge is registered on a grid of readout elec- 
trodes on an insulating layer underneath the GEM foils. By using similar 
techniques as in the fabrication of the GEM foils, orthogonal strips can 
be created on the two sides of a copper-clad polyimide foil, which is then 
glued onto an insulating carrier. After chemical removal of the polyimide 
layer covering the lower strips, a grid of orthogonal readout strips is cre- 
ated that allows 2D reconstruction of the location of the hit [164]. 


PLANAR MPGDS WITH PIXELATED READOUT 


Some of the most powerful gaseous detectors have been created by com- 
bining planar MGPDs, like Micromegas or GEMs, with high channel 
density readout ASICs that have been originally developed for the read- 
out of silicon pixel detectors (a widely used readout chip is the Timepix 
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chip [343]). The use of industrial post-processing techniques to build a 
Micromegas structure onto this readout ASIC led to the development of 
InGrid (integrated grid) structures [189]. Gaseous detectors with Ingrid 
structures have been called GridPix. A very low material GridPix detector 
with minimal material budget (gas gap 1 mm and thinned ASIC) for use in 
low-mass vertexing detectors is Gossip [179]. GridPix detectors are cur- 
rently under investigation for use as active elements in future TPCs (see 
page 121). Due to the highly segmented readout the spatial resolution in 
these detectors is only limited by diffusion. 

GEMs have also been integrated with Timepix chips, with the added 
benefit of increased mechanical robustness (GEMGrid) [147]. 


7.5 DRIFT CHAMBERS 


A different approach to improve the position resolution of gaseous detec- 
tors is to exploit the finite speed, with which the electrons move through 
the gas. By measuring the time it takes the primary electrons to move from 
the point of the ionisation to the point where the actual electric signal is 
generated by the massive increase of charge in the avalanche, one can de- 
duce the drift distance, if the drift velocity is known. 

The start time for the drift time measurement can be provided by a fast 
external timing detector, or in experiments at accelerators from the known 
time structure of the beam. In both cases the time of flight and electric sig- 
nal propagation speed in cables and active electronics, which are likely 
to vary between different channels, must be taken into account and cali- 
brated. The start time can also be reconstructed from the data of the detec- 
tor itself, if several cells of the drift chamber are hit. 
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Figure 7.14: Pixelated MPGD readout. 
SEM image of an InGrid with partly 
removed Grid. The height of the pillars 
is 50 um [348]. 


Figure 7.15: SEM picture of GEMgrid 
detector. The amplifying structure 

is 55 um of SU-8 (a commonly used 
epoxy-based negative photoresist) with 
30 um diameter holes. The top metal 
layer is aluminium [147]. 


Such a detector is called a ‘drift cham- 
ber’. 


This is often called the to. 


In the simplest case the external timing 
signal could be from a plastic scintilla- 
tor with a PMT (see section 5.1). 


In experiments at accelerators this 
can also be used to identify the bunch- 
crossing from which the measured 
particle originated. 
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DRIFT CHAMBER CALIBRATION 


For the reconstruction of the position of the incoming particle, the correla- 
tion of drift-time and drift-distance must be known. This relation depends 
on the local electric field, the drift velocity at that value of the field, and 
includes the increase of the drift-distance due to deflection in a magnetic 
field (if present) by the Lorentz angle. Particularly in magnetic fields, the 
drift-time/drift-distance relation can be asymmetric on the two sides of the 
wire. Generally, the relation will be non-linear, and is in many cases vary- 
ing with the location of the drift cell within the experiment and/or time. 

While it is in principle possible to calculate the drift properties (see 
section 4.1), the quality of the results will be affected by the accuracy 
of the assumptions on the gas and field parameters made, small changes 
of which will have significant effects. Calibration of the drift-time/drift- 
distance relation using prototypes and reference tracking systems can also 
be compromised by the strong variations of the drift properties between an 
idealised set of operating parameters during the calibration and the actual 
operation of the detector. 

The most accurate calibration is thus achieved using the data from the 
detector itself, taken during the actual operation. In ‘autocalibration’, the 
drift-time/drift-distance relation is found in an iterative process, where in 
a series of iterations an improved version of the relation is obtained using 
the position information reconstructed using the relation from the previ- 
ous iteration. At the same time this method can also provide a calibration 
of the timing offset for each channel (the fo). This method is capable of 
tracking slow changes of the operating parameters, and thus is the most 
accurate tool to calibrate a drift chamber response (see for example [77]). 

The need for a stable drift-time/drift-distance relation puts an addi- 
tional requirement on the drift chamber gas: the drift velocity should not 
depend strongly on the electric field, so that local differences of the field 
or variations in time do not change the drift-time/drift-distance relation 
significantly. A drift velocity that satisfies this requirement is called ‘sat- 
urated’. It can be achieved for ‘hot’ drift gases for sufficiently high fields, 
but ‘cold’ gases typically display significant variations of the drift velocity 
with the electric field strength. 


DRIFT-TIME MEASUREMENT AND POSITION RESOLUTION IN 
DRIFT CHAMBERS 


As discussed in section 3.8 the drift-time of the ionisation electrons to 
the wire can be measured by time-to-digital converters (TDCs). The in- 
duced signal is a superposition from the signals generated by the move- 
ments of the ions created in the avalanches from several primary ionisation 
electrons. The time structure of this signal will reflect the arrival times of 
these electrons at the anode wire, where they start their avalanches. Due 
to the long duration of the signal produced by the drifting ions in a wire 
chamber, the signal from the individual primary electrons will pile up on 
top of each other with the difference in arrival times of the individual pri- 
mary electrons being small compared to the overall signal length. Hence, 
by selecting a threshold for the comparator firing on the signal induced 
on the wire, one effectively sets a threshold on the number of primary 
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This is often called the r-t or the x-t 
relation. 


In high rate/high gas gain applications, 
space charge from the slow ions created 
in the avalanches can also modify the 
electric field and thus the drift velocity. 
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Figure 7.16: Digitised drift times versus 
distance of the muon track from the 
wire for the ATLAS muon chambers in 
a testbeam prototype (without magnetic 


field) [427]. 


For the definition of hot and cold gases 
see section 4.1. 
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electrons that have to arrive at the wire to stop the measurement of the 
drift-time. This threshold, in units of number of primary electrons, is given 
by thrprim = thre /g, where g is the combination of avalanche gas gain and 
(pre-)amplifier, and thr.; the comparator electronics threshold (usually a 
voltage). 

The position resolution in drift chambers as a function of the drift 
distance shows a distinctive pattern. At short drift distances the resolu- 
tion is driven by the finite spacing between the primary ionisation clus- 
ters in the gas. It will be smaller for gases with a high ionisation density 
along the track. For larger drift distances the resolution becomes diffusion- 
dominated, and increases basically with the square root of the distance. 

The resolution improves for increased gas pressures, as both the spac- 
ing between ionisation clusters and the diffusion in the gas are reduced. 
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By the choice of threshold the system can be optimised for improved 
resolution in either the region dominated by the primary ionisation statis- 
tics, or the region dominated by diffusion. A higher effective threshold (in 
number of primary electrons) improves the resolution at large distance, as 
a later electron will be closer to the centre of the diffusion distribution. A 
lower effective threshold (in primary electrons) will improve the resolu- 
tion close to the wire, as the earlier electrons will be produced closer to 
the point of closest approach of the track. 
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Drift distance here is defined as the 
distance from the wire to the closest 
point on the track of the incoming 
charged particle. 


These effects are also studied in exer- 
cise 4. 


The price to pay is, of course, increased 
multiple scattering. 


Figure 7.17: Typical drift distance 
dependence of the position resolution in 
a drift chamber [439]. 


Figure 7.18: Effect of primary statistics 
on the drift distance measurement in 

a cylindrical wire chamber. Crosses: 
discrete primary ionisation clusters. 
Dashed lines: ideal distance. Full lines: 
actual distance to the first ionisation 
cluster. 


Figure 7.19: Measurements (dots) and 
simulation (lines) of the resolution in 
prototypes of the ATLAS MDT (cathode: 
aluminium tubes with 30 mm diameter, 
anode: wire diameter 50 um) for 
different effective thresholds [422]. 
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DRIFT CHAMBER GEOMETRIES 


Typical drift chamber geometries are either individual tubular or square 
sections for integration into planar structures, or larger systems which are 
designed to fill the central cylindrical tracking cavity in a collider detector. 
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Figure 7.20: Drift chamber geometries. a) Planar made of tubes. b) Planar made of square cells. c) Jet chamber (endview). d) Time Projection 
Chamber (side view). Conceptual, not to scale. 


A historically important drift chamber geometry for central tracking is 
the ‘jet chamber’, where radial layers of anode wires provide a large num- 
ber of points (about 1 hit per cm in r) along tracks from the vertex. These 
type of detectors had good two-track resolution (~100 um), good pattern 
recognition capabilities, and measurement of dE/dx. To compensate for 
the Lorentz angle the cells were often tilted. 
Figure 7.21: Example for a jet cell 


R137.998 o geometry: CDF Central Outer 
Tracker [25]. 
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A gaseous drift chamber technology that provides 3D information over 
a large volume with minimum material is the Time Projection Chamber 
(TPC). It is a cylindrical gas-filled volume split into two halves by a cen- 
tral cathode disk at very high voltage (100 kV in the ALICE TPC). Pri- 
mary electrons drift to the endplates, which are instrumented with a two- 
dimensional readout. In the case of existing and past TPCs this uses a grid 
of anode wires for avalanche multiplication. To prevent ions drifting back 


122 


into the detector volume a gating grid is used, which closes their path elec- 
trostatically after an avalanche on the anode wires. 

The position in the r-@ plane is determined from the ion signal in- 
duced on a plane of pads. The z information is obtained from the drift time 
of the electrons to the readout plane. The r@ information in this detector 
is maintained despite the very long drift distance because the transverse 
diffusion is reduced due to the parallel E and B fields (see section 4.1). 
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Because of the long drift-distance, calibration of the drift-time/drift- 
distance relation is particularly demanding. The long drift time makes this 
detector impractical for high luminosity hadron colliders, but they are used 
for heavy ion experiments (for example ALICE at the LHC) and are under 
consideration for future lepton colliders, where collision rates are lower. 
For future colliders, detector planes with better position resolution, for 
example based on MPGDs are being investigated [444]. One benefit of 
MPGDs like Micromegas or GEMs for the use in a TPC, in addition to 
their high position accuracy, is that their design limits ion feedback due to 
their amplification geometry. 


TWO-DIMENSIONAL POSITION DETECTION 


Typically, wire chambers with or without drift-time measurement most 
accurately record the position of the incoming track perpendicularly to the 
wire. However, sometimes position information along the wire is required, 
albeit usually with a much lower accuracy. Several approaches have been 
pursued for this. 


Charge sharing 


If a readout amplifier is connected to both ends of the wire, the charge 
signal induced on the wire at the location of the avalanche will be divided 
according to the ratio of resistances towards the two ends, independently 
of the overall length of the wire, and the capacitance or the inductance and 
their distribution. Position resolutions of less than 1% of the length of the 
wire are achievable [415]. 


Delay line 


Along the wire the signal propagates with a finite speed, close to the speed 
of light (~30 ps/cm). Again, both ends of the wire need to be instru- 
mented, but this time the position along the length of the wire is found 
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Figure 7.22: Schematics of a TPC with 
wire readout. 


Figure 7.23: ALICE TPC (modified from 
[49]). 


Typical wire resistances are a few kQ/n. 
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from the difference in arrival time at the two ends. Position resolutions of 


1-2 cm, independent of the length of the wire, can be achieved in this way. 


It can be shown that the achievable position resolution for charge shar- 
ing and delay line readouts is comparable for a wire length that is about 
250 times the gap between the cathodes surrounding the wire. Above this 
length the delay line measurement will yield the better result [153]. 


Stereo angle 


Stereo angle is a method for measuring the second coordinate that can be 
employed if more than one detector layer is available. If the different lay- 
ers are arranged so that the wires in one layer are tilted by a small angle 
(the ‘stereo angle’) with respect to the wires in the next, then the position 
along the wire can be found from the intersection of the positions recon- 
structed in the accurate direction in the two layers. The precision of the 
second dimension measurement can be improved by increasing the stereo 
angle, at the cost of a degradation of the combined position resolution in 
the accurate direction. 

At high rates this method can lead to confusion, which creates spuri- 
ous hits (‘ghosts’) in the reconstruction. 


Segmented cathode pads 


The most accurate technique to establish the second coordinate of an 
avalanche in a wire chamber is by measuring the position of the induced 
charge on segmented cathodes. 

As we have seen, the dominant contribution to the induced charge in 
a wire chamber is coming from the drift of the ions. As well as on the an- 
ode, a charge in the chamber volume induces also a charge distribution on 
the cathode. As the ions drift towards it, the distribution on the cathode 
becomes narrower and narrower, until all the induced charge on the plane 
collects on the impact point of the drifting ion. Conversely, at a point on 
the cathode away from the impact point the charge density will first in- 
crease, and then decrease as the ion approaches the cathode. If the cathode 
is segmented into strips or pads the charge induced on these segments will 
be the integral of the charge density over their area and the overall be- 
haviour will be similar. 

For practical applications the duration of the signal will again be lim- 
ited by an appropriate high-pass RC circuit, which also limits the band- 
width for integration of random noise. For the cathode readout the signal 
shaping has the effect of capturing only the initial increase of the induced 
charge, so that typically more than one strip/pad shows a signal, and the 
centroid of the recorded signals gives a measurement of the position of the 
drifting ion, projected onto the cathode. 

The weighting field for the signal induced on a cathode strip with in- 
finitesimal width 2da by the movement of the ions in a MWPC with the 
anode plane centred between two cathodes spaced apart by 2D can be pa- 
rameterised by the semi-empirical expression [259] 


d(rE,(r)) _ ,, 1—tanh?(K2A) 
da/D "| 4-K3tanh?(KA)’ 


(7.1) 
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This approach is also used in silicon 
microstrip detectors. 
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Figure 7.24: Stereo angle: hit wires 
are indicated by arrows, reconstructed 
position by ellipsis. 
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Figure 7.25: Confusion. Two actual hits 
(circles) and two ghosts (squares). 


For a discussion of a simplified charge 
distribution see exercise 5. 


124 


with A = x/D, and Kj, K2 and K; parameters that are found from the best 
fit to the data. The induced charge on strips of finite width can then be 
found from integration. 
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Because of the symmetry of the induced charge distribution in the 
cathode strips/pads the position of its centre can be obtained from the 
centre-of-gravity of the measured signals in these segments, 


x 2s La Ann 
centre — Ss 
Endr 


where the sum goes over all the strips, x, are the positions of the strip cen- 
tres perpendicularly to their length, and qn is the recorded charge on each 
strip. 

In practice, only strips with a signal above a threshold given by the 
noise are considered in the sum, as the sum of many noisy pads would 
otherwise make the measurement meaningless. A common approach is to 
modify eq. (7.2), 


(7.2) 


Sn Ln Xn (dn F B) 
centte = “7. Dpi? 
Èn (dn — B) 


with a bias level B = bY. qn, with 5 x 107° < b < 25 x 107°, and only the 
elements are considered in the sum for which g, — B > 0. However, the 
suppression of elements in the sum introduces a bias pattern of the mea- 
sured position that is periodic with the strip width, and zero at the edges of 
the strip and the centre [408] (This is sometimes called an ‘s-shape’ pat- 
tern.). The non-linearity can be improved by reducing the strip width. One 
way to achieve this without increasing the number of readout channels 

is by dividing the strips, and connecting only every second or third strip, 
with the intermediate strips floating, but tightly coupled capacitively to the 
readout strip [461]. 


(7.3) 


Cathode 


Anode wire 
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Figure 7.28: Floating cathode strips. 
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Figure 7.26: Cathode weighting 

field as a function of position 

for D/R = 10000/15 (modified 

from [259]). R is the anode wire radius 
and s the anode wire spacing. 


In a detector with pad segmentation a 
similar expression would be used for the 
second, orthogonal direction. 


This algorithm would fail in high track 
density environments. If separation of 
close tracks is needed, a more elaborate 
reconstruction is required, which com- 
pares the measured charge distribution 
with a standard charge distribution (for 
example using the parameterisation of 
eq. (7.1)). In this way a two-track sepa- 
ration of a few tenths of the strip width 
can be achieved [267]. 


Position [mm] 


-0.4 -0.2 0 


0.2 04 

Residual [mm] 
Figure 7.27: Reconstructed track 
position from cathode versus residual 
(measured track position minus refer- 
ence position). The position has been 
determined using eq. (7.3) as described 
in the text with b = 0.025 (modified 
from [408]). 
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To obtain sensitivity to the avalanche position the sum in eq. (7.3) 
must include more than one element, and thus the strip width should be 
chosen so that a signal above the noise threshold is always obtained for 
at least two strips. A good rule of thumb is that the optimal strip width is 
about twice the distance from the wire to the cathode [259]. 

Position resolutions of a few 10 um can be achieved in the direction of 
the wire using segmented cathode readout [188]. 


CATHODE STRIP CHAMBERS 


Cathode strip chambers (CSCs) are large-area planar MWPCs that rely on 
the readout of the cathode pads to achieve good spatial accuracy. The gap 
is of the order of 5 to 10 mm, to allow for operation at high rates. Thinner 
gaps result in short electron drift times (up to a few 10 ns), and thus mod- 
erately good time resolutions. An advantage of the cathode strip readout 
is that the charge interpolation is a relative measurement of the signal in 
adjacent pads. Hence the chamber performance is not degraded by (mod- 
est) variations in environmental (temperature, pressure, magnetic field 
etc.) and operating (gas gain) conditions. However, calibration of the re- 
sponse of individual strips is required. This can conveniently be achieved 
by pulsing the anode wires, which will create a defined cathode charge by 
capacitive coupling. 


THIN GAP CHAMBERS 


A similar detector geometry, if very good time resolution is required, is 
the thin gap chamber (TGC), where the spacing between the anode and the 
cathode plane is reduced to order mm, thus reducing the maximum drift 
time in the gap. Because of the small scale, electrode non-uniformities 
become relevant. A uniform cathode surface is achieved by a layer of 
graphite, which has a sufficiently high electrical conductivity to allow 

for the removal of the induced charges after the drift, but is sufficiently 
transparent to allow for the induction of a signal on segmented readout 
electrodes on the backside of the cathode. 
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Figure 7.29: Schematics of the ATLAS 
muon TGCs [66]. 
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7.6 PHOTON DETECTION WITH GASEOUS DETECTORS 


The typical application of gaseous detectors is tracking of charged par- 
ticles, where a sufficiently large signal size and hence efficiency can be 
achieved from gas amplification for detectors of moderate thickness (down 
to a few mm). However, there are applications where we want to observe 
and locate (single) photons. Two energy regimes are of primary interest: 
In Cherenkov detectors photons in the visible or UV range need to be de- 
tected, whereas for the detection of transition radiation it is X-ray photons 
for which the detector needs to be sensitive. In both cases the challenge 
for the use of gaseous detectors is the low photon absorption cross-section 
in these energy ranges (see section 2.1). While the energy transfer from 
the photon, in particular for X-rays, can be substantial, the small interac- 
tion probability due to the low density material in a gaseous detector limits 
the efficiency for photon detection. Typically, the detection efficiency for 
photons in a regular gaseous detector is of the order 10~?. 


GASEOUS DETECTORS FOR DETECTING CHERENKOV RADIATION 


The challenge for Cherenkov detectors with gaseous photon detection is 
to achieve sensitivity of the gas, but transparency of the container (and 
the radiator). It requires the combination of a gaseous detector that has a 
low photo-ionisation threshold and photon entrance windows with a high 
transparency cut-off energy. 

The figure of merit for the photon detection is the quantum efficiency 
(QE), which is defined as the fraction of photons creating ionisation, as 
a function of the photon energy. There are two approaches: The first is to 
add a chemical with low photo-ionisation threshold to the gas. Two com- 
pounds are commonly used: Triethylamine (TEA), with a threshold energy 
of about 7.5 eV, and tetrakis (dimethylamine) ethylene (TMAE), with a 
threshold energy of about 5.3 eV. With these thresholds, TEA requires 
windows made of CaF, (transparent up to about 9.5 eV), while for TMAE 
quartz windows provide a sufficiently large energy gap (transparent up to 
about 7.5 eV). 
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One challenge with these additives in a gaseous detector is the in- 
creased potential for photon feedback, requiring heavily quenched gas 
mixtures. Typical quencher additives used are therefore pure CH4 or 
CH4/C2H6 75/25. TEA and TMAE are liquid at room temperature, and 
they are usually added to the detector gas by bubbling the main chamber 
gas through the liquid. The drawback of TMAE is its low vapour pressure 
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The energy range above 6.2 eV is called 
the ‘vacuum ultraviolet’ (VUV) region, 
as photons with such energies are 
strongly absorbed in air, due to its 
oxygen content. 


Figure 7.30: Quantum efficiency 
(dashed lines) and transmission (full 
lines) for common detectors and 
transparent materials in Cherenkov 
detectors (data from [386, 479]). The 
exact data may vary due to changes 
in the environmental parameters and 
purity of the gases. 


Another measure to mitigate photon 
feedback is to keep the gas gain in such 
detectors low. MPGDs like GEMs can 
be useful, as their geometry can prevent 
the propagation of photons [482]. 
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at room temperature (0.67 mbar, compared to 96 mbar for TEA), which 
requires heating and tight temperature control for the bubbler. 

The alternative approach to detect UV photons in gaseous detectors 
is to use a cathode coating with a low work function [373]. The material 
with the highest quantum efficiency for such purpose is CsI, because of 
its low work function, but also because it has a long electron absorption 
length (about 16 nm) [163]. Thin layers can be vacuum deposited either as 
a semi-transparent thin (about 100 um) layer on the inside of the entrance 
window or on a metallic surface on the reverse cathode of the chamber. 
The quantum efficiency can be enhanced by heating the coating under 
vacuum after deposition. Contamination with oxygen or water must be 
avoided. 

The quantum efficiency for CsI if used in gaseous detectors can be 
smaller than for emission into vacuum due to back-scattering of the photo- 
electrons into the electrode in elastic collisions on the molecules in the 
gas. Hydrocarbon or fluorocarbon gases have a higher probability for in- 
elastic collision due to rotational and vibrational levels and thus the back- 
scattering effect is strongly suppressed, with a photo-electron collection 
efficiency of about 90% of the vacuum value for CH4 or CH4/i-C4Hj9 
mixtures [224]. In addition, for efficient extraction, the field at the loca- 
tion of the coating should be high. The quantum efficiency also can be 
increased by the adsorption of TMAE in the coating. 


GASEOUS DETECTORS FOR DETECTING TRANSITION RADIATION 


In principle, gaseous detectors are attractive for the active component of a 
transition radiation detector, as they provide an affordable solution for the 
large detection area usually needed in these detectors. For the detection of 
the high energies from transition radiation (typically a few keV), the strat- 
egy is to use a gas with high-Z atoms, as the absorption length for pho- 
toelectric effect scales with Z” with m between 3 and 5 (see section 2.1). 
For xenon, for example, the absorption length in this energy range is about 
10 mm. As usual, an appropriate quencher needs to be added for the use in 
an MWPC. CHg and CO3 are widely used quenchers. It should be noted 
that due to the small emission angle for transition radiation photons (see 
section 2.9) the location of the energy deposition from TR photons will 
usually be indistinguishable from the charged energy loss along the track 
of the primary particle. 


7.7 AGEING OF GASEOUS DETECTORS 


In principle, the exchangeability of the gas in the detector would make 
radiation damage to the medium not a concern. However, after responding 
to a large number of incoming particles, polymers formed from the or- 
ganic molecules in the gas and contaminants will begin to cover the elec- 
trodes in the detector as solid or liquid deposits, leading to performance 
degradation that ultimately will make the detector inoperable [296]. On 
the anode wires the deposits can be conductive (typically carbon) or a non- 
conductive polymer. In both cases the increase in radius lowers the field 
around the wire, whereas in the latter case the insulating layer will also 
trap electrons, further reducing the field. The net result is progressive gain 
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For further discussion of actual tran- 
sition radiation detectors see sec- 
tion 12.5. 


For a comprehensive summary of ageing 
in gaseous detectors see [474]. 
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loss, which makes operation of the chamber more and more difficult, as 
the deposits are generally not uniform. 


On the cathode, the deposits lead to the Malter effect, where an insu- 
lating layer on the cathode traps ions on the surface, creating an electric 
field through the layer. The field can become strong enough to rip elec- 
trons out of the cathode, and accelerate them towards the positive charge 
layer. There not all of the electrons are captured, but some enter the gas 
volume, where they will be the source for another avalanche at the anode 
wire. The Malter effect thus leads to self-sustained dark currents, which 
decay only slowly when the high voltage is removed from the chamber. 

The constituents for the polymerisation are free radicals dominantly 
formed from the organic molecules in the gas (in particular the quencher), 
but also contaminants. These radicals are created in abundance in the 
avalanche process. The radicals will then link to polymer structures that 
are often branched, making them particularly difficult to remove. 

A commonly used scale parameter to describe the effects of ageing 
in gaseous detectors is the charge accumulated over the lifetime per unit 
length of wire in a wire chamber, or per unit area for gaseous detectors 
with planar electrodes or MPGDs. The accumulated charge is given by the 
product of the rate of incoming particles, the primary ionisation per in- 
coming particle, and the gas gain. Ageing effects are usually of a concern 
for wire chambers that accumulate 1 C/cm and above, although ageing 
effects can develop at significantly lower levels if care is not taken. 

The chemistry of the polymerisation is not well understood, but some 
general trends are known, and lists of materials to avoid or to be used in 
contact with the gas have been compiled [182]. Of particular concern are 
silicon, which enters gaseous detectors as contaminant from silicon oils 
or grease or mould release agents, and phthalates, which are used as soft- 
eners in plastics. Both have a tendency to make their way into polymers 
accumulating on the anode wire. 

In general, radiation damage effects in gaseous detectors increase for 
lower gas flow, indicating that the flow helps removing active radicals, and 
the polymerisation is not constrained by the supply of a gas component 
from the input gas. 

Some additives are known to reduce ageing, and even to sometimes re- 
verse ageing effects. In particular, these are chemicals containing oxygen, 
which have a lower tendency to form polymer chains. Examples for such 
compounds are alcohols or methylal, but also water, which in addition is 
also believed to be beneficial in reducing Malter effect by increasing the 
conductivity of the insulating layer on the cathode, so that surface charges 
can discharge. 
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Figure 7.31: Examples of Si-deposits 
on anode wires. Left: Wire from the 
CDF central tracking chamber [143]. 
Right: Wire from CMS muon chamber 
prototypes [247]. 


Malter’s original paper is [356], al- 
though the effect is described in an 
earlier paper [275]. 


A free radical is an atom, molecule, 

or ion that has at least one unpaired 
valence electron. Radicals are typically 
highly chemically reactive. 


In typical detector gases it takes more 
than 10 eV to ionise a gas molecule, but 
only a few eV to break a chemical bond 
and create a radical. 


It should be noted that the accumulated 
charge is not sufficient to describe age- 
ing in gaseous detectors fully. Ageing 
does also depend on the rate of charge 
accumulation, with ageing effects more 
severe at lower rates. This is relevant 
when accelerated ageing tests are per- 
formed. 

The relevant field is plasma chemistry. 
However, there studies are usually per- 
formed at significantly lower pressures 
and in AC fields. Nevertheless, attempts 
have been made to carry over conclu- 
sions from there to wire chambers. 


Figure 7.32: Methylal. 
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An additive that can have both beneficial and detrimental effects on 

ageing is CF4. This ambiguity is a manifestation of the “Competitive Abla- 

tion and Polymerisation’ (CAP) principle [509], which states that ablation 

is creating reactive monomers from the neutral monomers entering the 

gas volume (detrimental), but also from the break-up of existing polymers 

(beneficial). These radicals can then be the precursor to long-chain poly- 

merisation, but also to the formation of smaller stable molecules that are 

still volatile and are removed with the exchange of the detector gas. CF4 in 

the presence of oxygenated chemicals will act as an etching agent, and is One other concern with CP4 is that it 
. an aa = can lead to the formation of hydroflu- 

even capable of removing polymers containing silicon from anode wires, oric acid (HF), which is a potent acid 

while CF, in conjunction with hydrogenated substances will tend to poly- #4" particular etches glass. 

merise. No wire chamber has been able to operate at 20 C/cm and above 

without the addition of CF4 to the chamber gas. 


Figure 7.33: Left: Anode wire after an 
accumulated charge of 0.07 C/cm in 
Ar/CF4/CH4 67/30/3 [216]. Right: An- 
ode wire after 20 C/cm in Xe/CF4/CO2z 
70/20/10 [35]. 


Ageing in MPGDs depends on the size of the electrodes. Detectors 
with small electrodes, like MSGCs, will suffer similar ageing effects 
as wire chambers (in addition to discharge damage as discussed in sec- 
tion 7.4) taking into account the smaller electrode size. Similarly as in 
wire chambers, high rate operation requires high purity of the gas and 
careful selection of detector materials. If these conditions are met, accu- 
mulated charges of 0.1 C/cm can be achieved [158]. 

Planar MPGDs like GEMs or Micromegas undergo the same ageing 
mechanisms, but the effects are less severe due to the increased electrode 
surface area. Operation of triple layer GEMs for accumulated charge den- 
sities in excess of 0.8 C/cm? without degradation has been achieved [241]. 

Ageing effects in RPCs have been observed in early applications. 
These could be traced to either significant amounts of water (2000 ppm), 
which together with tetrafuororethane in the chamber gas resulted in 
the formation of hydrofluoric acid, or the flawed application of linseed 
oil, which has been used to improve the surface quality of bakelite elec- 
trodes [481]. If these issues are avoided, RPCs demonstrate good ageing 
performance [261]. 


7.8 SIMULATION OF GASEOUS DETECTORS 


We have seen in the previous chapters that in principle the physics of the 
various aspects of the operation of a gaseous detector is well understood. 
At the same time, the often complex stochastic nature of many of these 
processes makes an accurate analytic prediction of the detector perfor- 
mance impossible. However, Monte Carlo algorithms are capable of cap- 
turing these aspects and, in parallel to the development of gaseous detector 
designs, a suite of powerful programmes have been developed, which cap- 
ture all aspects of the operation of a gaseous detector, from the generation 
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of the primary ionisation charges to the electrical signal created [486]. 
These results allow for a detailed prediction of the detector performance 
for a given set of design and operating parameters, and thus are very valu- 
able in the optimisation of gaseous detectors. 

The central piece of this software is GARFIELD [487]. This pro- 
gramme provides facilities to calculate analytically the electric field in 
simple geometries, generates primary ionisation, drifts the charges in the 
field, including collision effects and diffusion, replicates avalanche multi- 
plication and obtains signals from the drift of the final charges in the sys- 
tem. Some of its outputs are [487] 


e field maps, contour plots and three-dimensional impressions; 

e the wire sag that results from electrostatic and gravitational forces; 
e optimum potential settings to achieve various conditions; 

e plots of electron and ion drift lines; 

e x(t)-relations, drift time tables and arrival time distributions; 


e signals induced by charged particles traversing a chamber, taking 
both electron pulse and ion tail into account. 


GARFIELD invokes inputs from several other pieces of code, some of 
which have originally been conceived as standalone programmes, but do 
have an interface with GARFIELD: 


e HEED: This programme calculates primary ionisation yields in 
gases based on the PAIR model [460] (see section 2.4). 


e MagBoltz: This code [138] calculates drift properties for electrons 
in gas mixtures under the influence of electric and magnetic fields, 
using the Monte Carlo approach described in section 4.1 [137]. 


A python version of MagBoltz, PyBoltz [38], is available and can be 


found at [411]. 


e Garfield allows for the input of field maps from commercial finite 
element analysis (FEA) software, like Ansys, Maxwell, Tosca, 
QuickField and FEMLAB, which can compute approximate fields 
in nearly arbitrary three-dimensional configurations with dielectrics 
and conductors. 


e FEA programs perform reasonably well in the calculation of po- 
tentials, but they are not well suited to the calculation of fields in 
typical gaseous detector geometries, as the quadratic shape func- 
tions used in FEA programs do not describe 1/r or 1/r? fields well, 
and because electric fields can be discontinuous at boundaries. A 
different approach is the Boundary Element Method, where the el- 
ements are on the boundaries, not inside the problem domain and 


charges are computed for the boundary elements. A practical imple- 


mentation that interfaces with GARFIELD is neBEM (nearly exact 
Boundary Element Method) [383, 382]. 
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Contrary to popular belief, the original 
namesake for the programme’s name 
GARFIELD is not a lasagne-devouring 
feline, but the 20" president of the 

USA, J.A.Garfield (1831—1881). In 

1881 President Garfield was shot by an 
erstwhile follower, who felt betrayed 
for not receiving a political office in 
return for his support, and the bullet 
lodged in his body. Graham Bell then 
successfully developed the first metal 
detector to locate the bullet. However, 
despite working properly during the 
development, the detector failed on 
President Garfield. Only later on Bell 
realised that the metal detector was 
confused by the iron bed springs within 
the bed the wounded President was 
lying on [126]. In the end President 
Garfield did not die from the bullet, but 
two months later from infections caused 
by the doctors searching with their 
fingers for the bullet in his body without 
proper sterilisation. The lesson is: For a 
successful technology you need to know 
your physics and apply it correctly in 
the required environment. 


Originally, this programme calculated 
the transport parameters by solving 
the Boltzmann transport equation, 

eq. (4.9) [136], which was the motiva- 
tion for its name. 
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e Finally, the calculated signals from Garfield can be used as input to 
publicly available commercial or free electronic circuit simulation 
software like SPICE and its derivatives. 


All this software is constantly maintained and improved to meet 
the challenges of new detector technologies and design ideas. Origi- 
nally, GARFIELD has been written in FORTRAN, but a C++ version, 
GARFIELD++ [446], is now available. This currently represents the most 
accessible version of the code. 

Some examples for the use of this software for end-to-end simulation 
projects can be found in [422] (ATLAS MDT), [425] (RPCs), or [155] 
(GEMs). 

A different set of detector simulation software is GEANT (GEometry 
ANd Tracking) [260], where the emphasis lies less on the prediction of 
the position measurement performance for single charged particles in a 
gaseous detector, but on the prediction of the physics-relevant response 
(momentum or energy resolution) of all components of a detector system 
in the presence of all the matter in the system. This includes modelling 
of energy loss of charged particles, including radiative energy loss, and 
multiple scattering [29, 45]. 

For a complete simulation of gaseous detector it is possible to connect 
the GEANT framework with GARFIELD++ [261]. 
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GARFIELD++ also can simulate the 
signals generated in semiconductor 
detectors. 
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Key lessons from this chapter 


e The low density of the active material in gaseous detectors limits the primary ionisa- 
tion charge from the passage of a charged particle. To still obtain detectable electrical 
signal internal avalanche amplification is required. 


e A common way to achieve the fields for avalanche multiplication is to use thin wires. 
In a wire chamber the induced signal is dominated by the drift of the ions. Due to 
their limited speed the time to collect all the charge is often prohibitively long, but it 
can be truncated by appropriate electrical filtering, as most of the signal is induced 
early on. 


e To suppress photon feedback resistive cathodes and/or the addition of a quencher gas 
are required. 


e The gases used are mixtures designed to achieve various requirements (high ion- 
isation, high gas gain, no photon-feedback, saturated drift velocity, low diffusion, 
high-rate operation, long lifetime). Several of these requirements cannot be satisfied 
at the same time, thus there is no gas that will work for all applications. Careful opti- 
misation of gas mixture and geometric design for the specific application is required. 


e This optimisation is helped by the availability of a suite of powerful software simula- 
tion tools (Garfield/HEED/MagBoltz). 


Even though gaseous detectors generally have a limited position measurement and 
high-rate performance they are still an effective and affordable way to instrument 
large area detectors like muon detectors. 


e A TPC is a large-volume, low-material tracking detector which provides a large num- 
ber of space points (with moderate precision), and the capability to measure dE /dx. 


The active development of MPGDs is pushing performance parameters in gaseous 
detectors to new limits. 


EXERCISES 
1. Stability of wires ina MWPC. 


Consider displacement of the anode wires in a MWPCs as shown in Figure 7.6. 


a) Show that the force per unit length between two parallel wires at the same potential Vo and 
with a capacitance per unit length C; at a distance r is given by A = (C\Vo)?/(27€0r). 
b) Assume that the wires are displaced alternately by a distance 6 < s perpendicular to the cen- 
tral plane as shown in Figure 7.6, where s is the spacing between wires. Use superposition 
and the symmetry in the problem to show that the force on one wire from all the other wires 
in the system is F = (1(CV)?/(4€0))(5/s7). 
To counter this displacement, the wire is held under tension. In equilibrium, the restoring 
force due to the tension must equal the displacing force caused by the interaction with the 
other wires. Find a solution for the displacement of the wire as a function of the position 
along the length of the wire. Use the boundary conditions to find a valid solution for the wire 
tension. 


c 


wm 


If the wire tension exceeds this value, no solutions are possible, and the displacement 6 must 
remain zero along the wire. 
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2. Gas gain in a wire chamber. 


What are the effects of the following changes to the gas gain in a proportional wire chamber? 


a) 
b) 
c) 


Increasing the diameter of the anode wire. 
Increasing the gas pressure. 


Increasing the fraction of quencher gas. 


3. Time response of the signal from a proportional chamber. 


a) 


b 


eS 


A particular chamber configuration has an anode wire radius of a = 10 um and a potential 
on the wire of Vp = +3 kV. The cathode is a cylinder of radius b = 10 mm at zero potential, 
concentric with the wire. Assume that the avalanching occurs at a distance 6 = 1 um from 
the anode. Show, using Ramo’s theorem, that the induced charge signal from a charge Q dis- 
placed through a distance dr in the chamber is 


calculate the relative contributions to the charge pulse from the proportional chamber by 
electrons moving to the wire, and by the positive ions moving away. Show that the contri- 
bution of the electrons to the signal is negligible. 
The velocity of the ions is a linear function of the electric field, E, in the tube: 
dr E 
a 


The mobility u is equal to 1.7 cm?s~!V~! for this chamber. Using this relation and the ex- 
pression for dq given above, show that the signal current for the positive ions (as a function 


of time) is e OC 1 
l = 
Are &y(at+d)? ” 
O f+ ne 


and find the total drift time (T) of the ions. Sketch the time development of the current pulse 
(from ions) due to the passage of this particle. Show further that the total charge (from the 
positive ions) collected (after T) is: 


What is the charge collected from the movement of ions for the parameters given above? 
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4. Position resolution in a drift chamber. 


Consider a cylindrical drift chamber. A minimum-ionising particle crossing the chamber creates 
on average N, electron-ion clusters per unit length. The closest cluster to the anode wire is used to 
estimate the perpendicular distance from the track to the anode, de. The radial distance from this 
cluster to the anode is r. 


a) Estimate the resolution in de as a function of r. 


b) Explain qualitatively the effect of diffusion on the resolution for de due to diffusion at large 
values of r. 


c) Why is the contribution to the resolution from the precision of the arrival time measurement 
independent of r? 


. Charge induced on cathode planes. 


A charge located between two cathode planes induces a charge density on the planes, which can be 
found using the method of mirror charges. For simplicity we assume here that the charge is central 
at z = d/2 between two infinite cathode planes at a distance d. 


a) Show that the electric field induced by the charge on the surface of the plane is given by 


2 d 
E;(r)= A y 1)* (2k +1)5 = A 1 
z MEQ fH (2k + pe ay 2ed cosh (=) , 


where r is the distance on the plane from the point of projection of the charge onto the plane. 


b) Find the charge density on the cathode from the field using Gauss’s law. 


. Charge decay. 


Consider a material with a conductivity of o and a relative permittivity €,. Show that any charge 
buildup will decay exponentially with a time constant T = &€,/o. (Hint: use the appropriate 
Maxwell equation in 3D and the microscopic version of Ohm’s law.) 


Evaluate the time constant for copper using o = 1.58 x 1078 Qm and assuming £, = 1. 


8 Liquid detectors 


Liquid detection media are attractive when a detector with large stopping 
power and full sensitivity over a large detector volume is required. Hence 
the main applications for detectors with a liquid detection medium are in 
calorimeters, in neutrino physics and in dark matter searches. The high 
density compared to gases entails large interaction probabilities and high 
energy loss per track length for charged particles. At the same time diffu- 
sion for drifting ionisation charges is low. 

The largest particle detectors today use the most abundant liquid 
on earth, water, exploiting its transparency for the visible photons from 
Cherenkov radiation. We will discuss water Cherenkov detectors in more 
detail in section 12.4. 

For detectors that are not relying on Cherenkov radiation, the most 
widely used liquids in particle detectors are noble liquids because of the 
low intrinsic attachment electron coefficient and/or self-absorption for 
scintillation photons. Of the noble liquids argon is the cheapest and hence 
liquid argon (LAr) has found wide use in particle detectors. 

Liquid xenon (LXe) has a higher atomic mass and higher density, and 
thus a higher stopping power than other noble liquids, and low diffusion. 
In addition, it has properties that make it particularly interesting for detec- 
tors for dark matter searches: It would have a high interaction cross sec- 
tion with weakly interacting massive particles (WIMPs). The only com- 
mon long-lived radioactive contaminant, 85Kr, can be easily removed, and 
natural xenon includes odd-neutron isotopes with non-zero nuclear spin, 
which gives sensitivity also to a spin-dependent WIMP interaction compo- 
nent. 

A liquid detection medium can be circulated, which allows for con- 
tinuous purification of the medium. This is usually a necessity to achieve 
charge or photon transport over the distances required in large detectors, 
but it also can help to achieve radiopurity, if the radioactive contaminant 
can be removed. 

Room-temperature liquids for particle detectors have challenging pu- 
rity requirements, but for example tetramethylsilane (TMS) is being used 
in the KASCADE experiment [409]. 


8.1 CHARGE READOUT 


In the simplest geometry, liquid argon is used to fill the gap in an ionisa- 
tion chamber, and an electrical signal is created by the movement of the 
ionisation charges created in the volume towards the collecting electrodes. 
This is a geometry often used in calorimeters (see chapter 11) with a typi- 
cal drift distance of a few mm to a very few cm. 

This geometry is appropriate for the energy measurement of high- 
energy showers, where the primary goal is the collection of a large frac- 
tion of the created ionisation charge, and not the exact localisation of 
the charge. In neutrino physics experiments we do want to reconstruct 
the tracks of the leptons and any charged hadrons recoiling from the 
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The high density makes liquids less well 
suited for tracking detectors due to mul- 
tiple scattering, while the liquid lacks 
the fine segmentation of the readout that 
semiconductors can provide. 


For review articles on liquid noble 
gas detectors see [190], for liq- 

uid argon [358, 355], and for liquid 
xenon [64]. 


Liquid noble gases are industrially 
obtained from fractional distillation 

of liquid air. Argon is the third-most 
abundant component of air (almost 1%), 
whereas the fraction of xenon in air is 
only around 90 ppb. 


These are often referred to as ‘warm 
liquids’. 
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interaction of the neutrino with the detector medium, and thus a finer posi- 
tion resolution is required. 

The obvious design choice is to increase the segmentation of the col- 
lection electrodes and to increase the drift length, so that a projection of 
the primary ionisation charge is recorded in the plane of the collecting 
electrodes, where a localised signal is induced due to the strong increase 
of the weighting field around the small collection electrodes. If the drift 
time of the primary electrons towards the collection electrode is mea- 
sured, this can be used to determined the distance of the creation of the 
primary ionisation to the detection plane. This is the principle of the liq- 
uid TPC. Compared to a gaseous TPC, liquid TPCs produce significantly 
more primary ionisation charges, and the diffusion is lower. 
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A common detection geometry uses wires to collect the ionisation 
electrons. To provide two-dimensional position information, two planes 
of wires are needed, but to resolve ambiguities a third plane is usually 
added [258]. The plane farthest from the drift volume is kept at the most 
positive potential (usually ground or slightly above), and is called the 
‘collection’ plane. The other wire planes are at a slightly (by a very few 
hundred V) lower potential and are referred to as ‘induction’ planes. As 
the electrons are drifting towards the collection wires, they pass through 
the induction planes, where they induce a bipolar signal on the induction 
wires, as they are drifting first with and then against the weighting field 
for the induction wires. On the collection wires the signal is unipolar. The 
weighting field towards the drift volume is small due to the large distance 
to the drift cathode, and thus the leading lobe of the signal on the first in- 
duction plane will be very small. Therefore sometimes an additional, unin- 
strumented, induction plane is added before the readout planes [110]. The 
scale of the duration of the recorded signals is given by the drift time be- 
tween the wire planes and is typically us, compared to ms for the drift 
time in the drift volume (with a typical drift field of a few hundred V/cm). 

The LAr TPC with the longest continuous operation is part of the Mi- 
croBooNE experiment [14]. Its drift volume is 2.3 x2.6x 10.4 m? filled 
with 90 t of LAr and a drift field of 500 V/cm for a maximum drift time 
of 1.6 ms. It is read out electrically with 8256 wires in three planes at 
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This benefit can only be realised if the 
liquid is pure enough to allow for large 
drift distances. 


Figure 8.1: Schematic drawing of a 
single phase liquid TPC (left) and 
signals in the position/time domain 
(right) [14]. The ionisation electrons 
drift in a large electric field towards 
the anode plane. The relative voltage 
between the wire grids is such that the 
drifting electrons pass through the first 
two planes and are collected on the 
third plane. 


This is referred to as a ‘gate’ by anal- 
ogy to the wire layer in similar location 
in a gaseous TPC, although it does not 
fulfil a gating function (prevention of 
ion back drift into the drift volume), as 
there is no avalanche multiplication. 


The mobility for ions is significantly 
lower, so that it can take minutes until 
the ions are removed from a large 
(O(m)) LAr TPC. 
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+60°/—60°/0° at —200/0/ + 440 V with a wire pitch of 3 mm, and op- 
tically with 32 PMTs with 200 mm diameter. 


NuMZ DATA: RUN 10811, EVENT 2549. APRIL 9, 2017. 


The choice of wires as the electrode in LAr TPCs is driven by the sim- 
plicity with which such electrodes can be assembled into a regular readout 
grid, and the straightforward connection to a readout amplifier, but not 
by the aim to achieve an electric field sufficiently high to cause avalanche 
multiplication. Stable high-gain (>10) avalanche multiplication close to 
a wire in pure noble gases at cryogenic temperatures has proven to be an 
elusive task. However, internal amplification is desirable, and has been 
achieved using similar amplification structures as in MPGDs [173]. 

One geometry making use of such amplification structures is the dual- 
phase TPC, where the drift field is vertical, bringing the electrons close to 
a liquid-vapour interface close to the top of the TPC. An extraction field 
created by an extraction grid within the liquid propels the electrons across 
the liquid-vapour interface. In the gaseous phase the electrons undergo 
avalanche multiplication in planar amplification structures called ‘large 
electron multipliers’ (LEMs) that are made of metalised and perforated 
printed-circuit board (PCB) plates and achieve avalanche multiplication 
inside the holes, similar to a GEM (see section 7.4). 
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8.2 PHOTON DETECTION 


As discussed in section 5.4, the passage of charged particles through cryo- 
genic liquids also results in the emission of long-range scintillation pho- 
tons in the VUV. These photons are prompt as the scintillation usually has 
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Optical readout of scintillation in the 
liquid will be discussed in the next 
section. 


Figure 8.2: Left: The MicroBooNE LAr 
TPC in its cryostat [14]. The drift field 
is vertical with the readout planes on 
the left, and the neutrino beam running 
along the length of the chamber, enter- 
ing from the rear face. Right: Zoomed 
event display from MicroBooNE with 
an electromagnetic shower [372]. The 
colour indicates the magnitude of the 
ionisation signal. 
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75*10"cm* Cu cathode 
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Figure 8.3: Gain in a triple-GEM at 

cryogenic temperatures for different no- 

ble liquids as a function of the voltage 

applied across each GEM [154]. 


Also sometimes called ‘thick gaseous 
electron multipliers’ (THGEM). 


PCBs are used instead of polyimide as 
in GEMs, because their stiffness allows 
for the simple construction of large-area 
amplification structures, and multiple 
scattering material is not a concern. 


Figure 8.4: Vertical cross-section of the 
dual-phase detector as originally pro- 
posed for the DUNE experiment [187]. 


VUV is the acronym for vacuum ultra- 
violet, which covers wavelengths below 
200 nm, where oxygen in the air is 
strongly absorbing. 
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a fast (a few ns) component, and they can thus be used to determine the 
time of the ionisation event. In neutrino experiments this is how the time 
of the interaction in the detector can be determined, which gives the start 
of the drift time of the electrons from the ionisation location to the detec- 
tion plane, which in turn is used to obtain the drift distance. 

Because of the size of typical neutrino experiments an affordable way 
to provide the large detection coverage required is by using PMTs. The 
necessary wavelength shift from the VUV to the visible where the PMT 
sensitivity peaks is often achieved by coating the PMT with tetraphenyl- 
butadiene (TPB). 

An alternative method to achieve large area coverage for the detec- 
tion of scintillation from liquid argon is the ARAPUCA (Argon R&D 
Advanced Program at UniCAmp) device [354]. In this device photons 
in the visible wavelength regime are detected by a silicon photomultiplier 
(SiPM). However, a typical SiPM is relatively small (a very few cm”) and 
to achieve a large acceptance the photons from the scintillation need to be 
guided to this detector. This is achieved by trapping the photons in a box 
with highly reflective walls where they can bounce off several times until 
they hit the detector. The photon is trapped in the box by a dichroic en- 
trance window. The window is coated with two wavelength shifters. On 
the outside, a wavelength shifter is used that shifts the VUV photon from 
the argon scintillation to a wavelength just below the threshold for trans- 
mission of the dichroic window. The photon can pass the window, where 
the wavelength is shifted by the inner wavelength shifter to a wavelength 
above the window threshold, so that the photon is trapped in the box. The 
wall reflection and the window transmission coefficients can be made high 
enough to obtain detection efficiencies of a few % for devices with a cov- 
erage of a few thousand cm?. 

In a dual-phase TPC a second source of photons is electrolumines- 
cence caused by the ionisation electrons in the high electric extraction 
field in the gaseous phase above the liquid. At low pressures (@(mbar,) 
and less) the electrons collide with atoms in the gas, which get excited 
into a P state. In the subsequent de-excitation the atom emits VUV pho- 
tons with wavelengths of about 105 nm and 107 nm in argon, and 130 nm 
and 147 nm in xenon for transitions from the excited singlet and triplet 
states, respectively [190]. At higher pressures formation of diatomic ex- 
cimers becomes more probable and dominates at | bara. These excimers 
emit scintillation similarly to the primary scintillation process in the liq- 
uid, with similar wavelengths. The secondary photon yield is proportional 
to the drift length of the ionisation electron and, depending on E /p (the 
ratio of electric field to pressure), the yield is about several hundred pho- 
tons per cm. As both emissions are in the VUV the same photon detectors 
can be used. 

Readout of secondary scintillation is particularly attractive for dark 
matter searches because the ratio of the primary (from excitation in the 
liquid by the original particle) and secondary (electroluminescence from 
avalanches at the amplification structure in the gas) scintillation response 
is very different for electrons and nuclear recoils, allowing to reduce ra- 
dioactive backgrounds. Further reduction of background from the walls of 
the chambers is achieved by rejecting events in the vicinity of the walls. 


Detectors in Particle Physics 


Silicon photomutltipliers will be dis- 
cussed in section 9.9. 


Lightguides would be too bulky and 
not be efficient because of Liousville’s 
theorem. 


A dichroic window is only transparent 
at wavelengths below a certain thresh- 
old. 


For example, p-terphenyl (PTP) with 
an emission wavelength of 350 nm for a 
window threshold of 400 nm. 

For example, TPB with an emission 
wavelength of 430 nm. 


An improved version is the X- 
ARAPUCA, which uses a light guide 
that is doped with the wavelength shifter 
inside the box, to improve the guidance 
to the photon detector [166]. 


Thus, for a fixed drift length in the ex- 
traction field the secondary scintillation 
signal is proportional to the number of 
ionisation electrons. 


In neutrino detectors it is usually track- 
ing performance over a large volume 
that is required, and thus electronic 
readout is the standard. 
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Several dark matter search experiments of this design with increas- 
ing size have been built. One of them is the LUX-ZEPLIN (LZ) experi- 
ment [34]. It consists of a xenon volume with a maximum drift distance 
of 145.6 cm and a diameter of 145.6 cm with 7 t of LXe in the active vol- 
ume. The vertical drift field in the liquid is 300 V/cm. In the extraction 
region, the electric field strength in the gas is in the range 10.1 kV/cm to 
11.5 kV/cm, sufficient for electroluminescence. The time between the pri- 
mary and the secondary scintillation signal is used as a measure for the 
vertical coordinate of the primary interaction. The horizontal location is 
determined by the pattern of PMT hits. The spatial resolution for events 
at the minimum detector threshold is better than 1 cm and improving with 
increasing energy. 

For the primary scintillation response, the goal is to maximise the 
number of detected VUV photons, as this defines the threshold of the de- 
tector. This is achieved by the use of high quantum efficiency (QE) PMTs 
optimised for the wavelengths of the VUV scintillation photons, and ma- 
terials in the chamber with low photon extinction. For the secondary scin- 
tillation response, the gain of the electroluminescence process makes it 
easier to collect enough photons even at the lowest energies, and the main 
design driver is instead to optimise the spatial resolution, especially for 
peripheral interactions. 


COMBINED READOUT 


The energy resolution in LXe detectors from the scintillation or the ion- 
isation signal is worse than expected from Poisson statistics due to fluc- 
tuations in the electron-ion pair recombination rate. Since recombination 
results in the creation of excited xenon atoms, which subsequently tend to 
form excimers, which decay under emission of a VUV scintillation pho- 
ton, the ionisation and the scintillation signal will be complementary and 
the combination of the two can improve energy resolution. This has been 
demonstrated for 662 keV ys from the decay of !?7Cs [65]. 


8.3 PURITY OF LIQUID DETECTION MEDIA 


The key requirement for the use of liquids as particle detector media 
is their purity, which is primarily required to allow for long-distance 
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Figure 8.5: Schematic view of the work- 
ing principle of a dual-phase TPC with 
photon readout. ‘SI’ is the signal from 
the prompt scintillation photons, ‘S2’ 

is the signal from the electroluminence 
photons created when the ionisation 
charge has reached the high field region 
in the gas layer above the liquid [223]. 


The grid for the ‘gate’ electrode is 

in the liquid and held at a voltage of 
—5.75 kV and the grid for the anode is 
at +5.75 kV. A woven grid with very fine 
pitch and very small diameter wires are 
used to ensure good transparency for 
the scintillation light. 


Peripheral interactions are interactions 
close to the wall, which are rejected to 
reduce backgrounds. 
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transport of either ionisation charges or scintillation photons. The typi- 
cal drift time of ms in a LAr TPC for neutrino experiments sets the scale 
for the required free electron lifetime in the liquid. Drifting electrons can 
be captured by contamination with high electron affinity like O2, H20 or 
CO2. Photons are also captured by N2. Impurity concentrations less than 
a few ten parts per trillion are required to achieve drift paths of several 
meters. 

The use of a fluid medium allows for circulation of the fluid, with the 
possibility to continuously purify the medium. To remove oxygen, the 
fluid is flushed through Oxisorb cartridges and molecular sieves. Activated 
copper-coated alumina granules have also been used, which can be regen- 
erated in an atmosphere containing hydrogen at elevated temperatures. 
With this purification electron lifetimes of 7 ms have been achieved [490]. 

Materials in the detector volumes have to be carefully selected and 
cleaned so that they do not introduce any contaminants. No effect on the 
electron lifetime might be observed when the material is immersed in 
LAr, but when it is positioned in the warmer region of the vapour phase 
(at ~200 K), the electron lifetime can strongly decrease. 

Smaller cryostats (several m°) can be evacuated before filling, but that 
will be impossible for large future neutrino experiments. There the volume 
will be filled first from the bottom with the dense argon gas replacing the 
air, followed by circulation of argon gas at 50 °C, and finally filling and 
circulation with LAr. 
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Figure 8.6: Energy spectra of 662 keV ys 
from "Cs at 1 kV/cm drift field in 
liquid xenon [65]. The top two plots 

are from scintillation (read out with 
PMTs) and ionisation (read out with 
charge-sensitive electronics in a gridded 
ionisation chamber), respectively. The 
charge-light anti-correlation is shown in 
the bottom-right plot. The straight line 
indicates the charge-light correlation 
angle. The left lower plot shows the 
improved energy resolution of the 
combined measurement. 


An empirical relation for the free 
electron lifetime T as a function 

of oxygen-like contamination is 

t/ms ~ 0.3/(N/ppb), where N is 

the concentration of oxygen equivalent 
impurities. 


Oxisorb is a commercial purification 
system that uses chemical absorption 
of oxygen by a silica gel with chromium 
oxide. Moisture is also absorbed physi- 
cally. 
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Key lessons from this chapter 


Detectors based on liquids provide large uniform detection volumes, high stopping 
power, and low diffusion for drifting electrons. 


The most common liquid detector media not relying on Cherenkov radiation detec- 
tion are liquid noble gases. Of these argon is the most affordable, whereas xenon has 
a higher atomic mass and density, and additional features that make it particularly 
suited for dark matter search experiments, but also make it attractive for use in very 
high resolution electromagnetic calorimeters. 


A 3D position information can be measured in a TPC configuration from the signals 
induced on a stack of anode wire planes (for 2-dimensional location in the plane), 
together with a measurement of the drift time for the 3" coordinate. To start the drift 
time measurement the prompt scintillation signal in the liquid can be used. The scin- 
tillation signal is in the VUV. 


While stable avalanche multiplication around wires is not possible in pure cryogenic 
liquids, ionisation electrons can be extracted from the liquid by high electric fields, 
and create secondary scintillation photons in the VUV (electroluminiscence). 


One major challenge in these detectors is purity of the liquid, to prevent loss of ion- 
isation or scintillation signal before detection. The fluidity of the detection medium 
allows for continuous purification. The required purity can also be beneficial in exper- 
iments where contamination can introduce radio-backgrounds. 


EXERCISES 


1. Energy loss in cryogenic liquids. 


Estimate the energy loss dE /dx for a minimum-ionising particle in liquid argon (p = 1.4 g/cm?) 


and for liquid xenon (p = 3.1 g/cm). Estimate (for example from Figure 2.24) the range of a 


1 MeV muon in LAr and LXe. 


2. Event display in a liquid detector. 


Explain the features of the event display in Figure 8.2. How does the energy deposition along the 
charged tracks vary? Can you estimate the energy of the charged particles? Can you estimate the 
energy of the particle causing the electromagnetic shower? 


3. Dual-phase TPC. 
A simple dual-phase argon TPC has a geometry as shown in the figure. 


OV 
vapour 5mm 


5mm 
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a) Calculate the electric field strength in the different regions of the TPC. 

b) What is the polarisation charge density at the liquid-vapour interface? (The relative permittiv- 
ities in the vapour and the liquid are ¢,*? ~ 1 and gï ~ 1.5, respectively.) 

c) Estimate, using Figure 4.8, the maximum drift time for electrons in this TPC. 


d) The speed of sound in liquid argon at —186 °C is about 810 m/s. Compare this to the electron 
drift velocity in this TPC. 


9 Semiconductor detectors 


In solids the stopping power for a charged particle is much higher than in 
gases due to the larger density. Hence, already thin layers of detector give 
sufficiently large signals to detect the impact of a charged particle. As a 
consequence, neither the spacing of primary ionisation clusters, nor diffu- 
sion of the charge carriers poses a practical limitation for the determina- 
tion of the position in a solid detector, and 6-electrons are not a concern, 
because of their shorter range. Another advantage is that the sensor mate- 
rial is itself mechanically stable and does not need a container. 

The key challenge in solid state detectors is the efficient collection of 
the ionisation signal. First and foremost, this requires high purity of the 
material, so that the charges are not trapped locally. Second, the material 
must be chosen so that the charges from the interaction of the incoming 
particle are mobile and thus can create a signal on collecting electrodes 
according to Ramo’s theorem, while at the same time movement of other 
charges, not associated with the passage of a particle, is suppressed. As 
discussed in section 4.3, semiconductors are well suited for this task. 

In this chapter we will focus on silicon detectors, as the most com- 
mon semiconductor detector material. We will first discuss the principles 
of operation of these type of detectors, and discuss the two most common 
implementations, as strip or pixel detectors, which are based on the prin- 
ciple of a reverse-biased pn diode junction. One concern for the operation 
of these type of detectors is the leakage current and its implication for the 
thermal stability of silicon detector systems. 

We will then discuss more advanced silicon detector technologies that 
rely on charge carrier drift, internal amplification, or the capabilities of 
modern semiconductor fabrication technologies. Finally, we will investi- 
gate the effects of high-intensity irradiation on silicon sensors and discuss 
alternative materials for solid detectors. 


9.1 PRINCIPLE OF OPERATION 
INTRINSIC CHARGE CARRIER DENSITY 


While there is a finite gap between the valence band and the conduction 
band in semiconductors (see section 4.3), it is small enough that some 
electrons can be promoted to the conduction band due to thermal motion. 
At the same time electrons promoted to the conduction band leave a hole 
in the valence band. The volume number density of intrinsic carriers of 
energy E is given by n(E) = p(E)F(E), where p(E) is the density of 
states and F (E) is the occupation probability. As electrons are fermions 
this is in principle given by the Fermi-Dirac distribution for electrons, 


1 
exp (s ) +1 


F(E) = 


However, at room temperature kgT ~ 26 meV, and as the Fermi energy is 
in the middle of the band gap, E — Er is a about two orders of magnitude 
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Diffusion can actually be useful, spread- 
ing the signal over several individual 
sense elements, which allows improved 
position resolution from the reconstruc- 
tion of the charge centroid. 


Intrinsic charge carriers are the charge 
carriers present in the conduction band 
in pure silicon. 


The Fermi energy Er is defined as the 
energy at which the occupation prob- 
ability is 50%. This is approximately 

half way between the conduction and 
valence bands. 
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larger than kgT, so that we can describe the occupation probability with a 
Maxwell-Boltzmann distribution, F (E) ~ exp|—(E — Er) /(kpT)]. 

The integrated (over energy) negative and positive (holes) charge vol- 
ume density are given by (see exercise 3) 


(E a ( nti 
Nn = —_ Se I 
a 2mh? . kpT 


2 (meer ( HE: 
Np =2 | —— exp | -———— ], 
á 2mh? P aT 
where Ee, Ep, Ey are the energies at the bottom of the conduction band, the 
Fermi energy and the top of the valence band, respectively, and m} and my, 
the effective masses for electrons and holes. 

For intrinsic semiconductors the density of electrons and holes are 


equal, nn = np = ni, where we have introduced nj, the intrinsic carrier con- 
centration, given by 


(9.1) 
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THE pn JUNCTION 


(9.2) 


At room temperature the intrinsic charge carrier concentration is so high 
that the resulting leakage current will tend to be much larger than the sig- 
nal current created by the passage of a charged particle (see exercise 1). 
Thus, to make a viable silicon detector, we need to remove the mobile car- 
riers from a region of the silicon. This can be achieved by joining n- and 
p-doped silicon. 

In n-doping, small amounts of a period 5 element (e.g. P, As or Sb) are 
used to replace silicon (period 4) atoms in the lattice (the period 5 atom is 
called a “donor’). This leaves an unpaired electron that is only very loosely 
bound, with an energy just below the conduction band. At room tempera- 
ture, these electrons will tend to be promoted to the conduction band and 
thus they are effectively free to move around the lattice if there is an exter- 
nal electric field. 

In p-doping, small amounts of a period 3 element (e.g. B, Al, Ga, In) 
are used to replace silicon atoms (such an atom in the lattice is called 
an ‘acceptor’). The period 3 element only bonds with three electrons to 
neighbouring silicon atoms in the lattice. Therefore a fourth electron can 
be ‘borrowed’ from a nearby silicon atom. This results in a less strongly 
bound state that is just above the top of the valence band. In a similar way 
as for n-doping, at room temperature electrons from the valence band will 
be promoted to this impurity level, resulting in a vacant electron state in 
the valence band. This vacancy can also move under the application of an 
external electric field. We simplify the picture of many electrons moving 
to fill vacancies by interpreting the charged vacancy as a positive ‘hole’, 
Le. itis effectively a free positive charge. 

Mathematically, doping changes the Fermi levels in the two regions by 


(9.3) 
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The motion of an electron (hole) in elec- 
tric and magnetic fields in a semicon- 
ductor can be treated as for a classical 
free particle but with an effective mass 


m, (m3). 


Intrinsic semiconductors are chemically 
pure. 


For pure silicon at room temperature 


ni œ 10! cm. 


As we will see, charge carrier removal 
at a pn junction works for thin de- 
tectors. However, if we want to use 

a semiconductor detector to detect 
gamma rays (~ 1 MeV) with high effi- 
ciency we need a large active volume 
because of the long absorption length. 
There, the only way to reduce intrinsic 
charge carrier density is low temper- 
ature. The finest gamma spectroscopy 
detectors are germanium detectors 

(E; = 0.67 eV) cooled by liquid nitrogen 
(see section 11.1). 


It should be remembered that while 
conceptually appealing, the electrons 
or holes here are not real, standalone 
particles, but really describe a state of 
the lattice (similar to an electron in a 
shell of an atom). 


È 
O | valence 


Figure 9.1: Band structure at a pn 
junction. 
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where na is the acceptor or donor density, respectively, and n; the intrin- 
sic carrier density. p doping brings the Fermi energy closer to the conduc- 
tion band, n doping brings it closer to the valence band. 

At the junction of n- and p-doped silicon the surplus charge carriers in 
each doping region diffuse into the other doping region and neutralise un- 
til thermal equilibrium is reached. As a result, the area around the junction 
gets depleted of charge carriers. The ions remain and create a space charge 
and an electric field. In equilibrium the resulting drift currents cancel the 
diffusion current. 

The depth of the depletion zone can be found from integrating the 
Poisson equation, d°@/dx? = —p /e, where £ = £,£ọ is the permittivity of 
silicon. In a simple model the number of free charge carriers is assumed to 
vanish over the depth of the depletion zone d = dn + dp where the dp,n are 
the depletion depths in the two doped materials, but to be constant outside. 
As a result of the first integration and assuming overall charge neutrality, 
we find dp/dn = Na/Na, where Na (Na) is the acceptor (donor) volume car- 
rier density. The second integration then yields 


2E Na +nq 
e Nang 


d= AV. 


(9.4) 


For an unbiased pn junction, AV is the difference of the Fermi levels in the 
two doping regions (typically a few hundred mV). However, the depth of 
the depletion zone can be increased by applying a DC bias voltage (neg- 
ative pole connected to the p doped, and positive pole to the n doped re- 
gion). This configuration is known as a reverse-biased diode. In that case 
AV = AEp/e + Voias ~ Voias, 1-€. a bias of 100 V will increase the depletion 
depth by more than a factor 10. 

The simplest silicon micro-strip detector geometry consists of a heav- 
ily p-doped (p*) region on a lightly n-doped bulk material. For exam- 
ple, na ~ 10!5 cm~? and na~ 10!? cm~?, so that AEp ~ 0.42 eV and 
dp = 0.02 um and dn = 23 um without bias. With a bias voltage of 100 V 
this becomes dp = 0.4 um and d, = 363 um. In practice the intrinsic bias 
voltage is negligible compared to the applied bias voltage. 

We usually want to collect a fast signal from the full detector depth, 
which requires a large electric field throughout the detector depth, i.e. the 
bias voltage must be at least as large as the depletion voltage. If the bias 
voltage is equal to the depletion voltage, there will be a region of low elec 
tric field and therefore the drift speeds of the charge carriers will be slow 
in these regions. This results in slower signals with ‘tails’ at long times. 
Therefore, in practice we want to operate the detectors with a larger bias 
voltage (‘over-depleteď’) to increase the charge collection speed. The lim- 
ited depletion depth of silicon detectors of the reverse-biased diode type 
limits the usable thickness of such silicon sensors to a few 100 um. 


9.2 FABRICATION OF SILICON DETECTORS 


Nearly all the silicon detectors we will consider require HV to operate. 
We typically need a bias voltage of (100 V). For a detector to work re- 
liably, care must be taken to ensure that all components in the system 
are compatible with this high voltage, including the actual sensors. For 
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At this point the Fermi energy in the two 
regions becomes equal. 


The relative permittivity of silicon is 
independent of doping, €, = 11.7. 
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Figure 9.2: Simple pn junction model. 


The notation n* (p*) refers to very high 
doping concentration for n-type (p-type) 
material. 


The depletion voltage is defined as the 
minimum voltage required to bias the 
full depth of the detector. 


This is studied in exercise 5. 


For extreme radiation levels like at the 
LHC we need higher values of the bias 
voltage, typically O(500 V). 
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example, the design of the sensors must minimise sharp edges where 
the electric field will be enhanced, which can result in local increases 
of the leakage current, called micro-discharges, that can, in the worst 
case, lead to electrical breakdown. The silicon detectors need to be rig- 
orously tested to weed out devices with excess leakage current from HV 
breakdown. 

The production of silicon sensors requires similar steps to those used 
in the electronics industry for the fabrication of CMOS chips. However, 
the minimum feature sizes in silicon detectors are far larger than used in 
modern CMOS chips and the dopant concentration in the bulk has to be 
much lower. The other important difference is that for a sensor the full 
wafer has to be exposed at once as opposed to the CMOS production of 
chips in which a small area is exposed and a step and repeat process is 
used. 

Silicon with the required purity is grown in cylindrical ingots, which 
are sliced into disk-shaped wafers. The surfaces are then lapped and pol- 
ished. The silicon atoms at a bare silicon surface are missing one silicon 
atom and this creates electrically active interface traps (“dangling bonds’). 
These traps will reduce the carrier lifetimes and the charge collection ef- 
ficiency, as well as create surface leakage current [76]. To prevent this, 
sensor manufacture starts with a thermal growth of a thin layer (((1 Um)) 
of silicon dioxide (SiO2) on the surface of the silicon. After this process 
oxygen atoms in this layer saturate most of the traps. A further reduction 
in the trap density can be achieved by adding hydrogen chloride to the 
oxygen [302]. 

As the silicon detectors are typically rectangular in shape they need 
to be cut out (‘diced’) from these wafers, for example using a diamond- 
tipped saw. Whichever technique is used, the cutting will inevitably result 
in ragged edges, potentially resulting in large electric fields that will make 
the edges conductive. Therefore, we need to prevent electric fields at the 
edge of the detectors. For p*-in-n sensors we can achieve this with p* 
‘guard rings’ at the edge of the detector (see also Figure 9.6), which de- 
crease the field strength towards the edge. The outer guard ring has the 
same voltage as the ‘backside’ to minimise leakage current at the edges 
and prevent any conductive path from the high voltage side to the ground 
via the edge of the detector [122]. The drawback of this design is that 
there will be small regions at the edge of the detector which are not biased 
and therefore inefficient (‘dead regions’). 

Strip detectors have readout strips that are usually AC coupled to the 
pre-amplifier. This is done with integrated capacitors using an insulating 
layer. For strip detectors a SiO, layer is often used to create the AC (ca- 
pacitive) coupling of the pt implants to the readout electrodes to isolate 
the amplifiers from the leakage current. 

The contacts are usually made of aluminium, which is connected to 
the readout amplifiers by wedge bonding. Similarly, aluminium is used for 
the bias voltage connection to the backplane, with a layer of n* implant to 
avoid a Schottky barrier and improve the ohmic contact. 

The patterning to create the detector structures uses photolithogra- 
phy (as used for the fabrication of CMOS chips). The desired material 
is applied to the surface of the silicon and covered by a photoresist. The 
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Breakdown occurs when the leakage 
current becomes too large even for 
small voltages, and exceeds the capabil- 
ities of the power supply. 


For a more detailed discussion of the 
process steps required, see [464, 316, 


435]. i 
The starting material for silicon detec- 


tors has to be high resistivity silicon 
(see exercise 11). 


Silicon detectors have been produced by 
specialist manufacturers. In section 9.10 
we will also consider very different 
designs of silicon detectors based on 
CMOS imagers. CMOS imagers are 
used in cameras on nearly all mobile 
phones. They are widely used in indus- 
try and in medical applications. Their 
use in astrophysics is also growing. 


This process is called ‘passivation’. 


The surface leakage current of the sen- 
sors is very dependent on the quality of 
these processes. Even with passivation 
silicon sensors are very sensitive to 
mechanical damage. A surface scratch 
can result in electrical shorts. Shorts 
can also be created by chemical con- 
tamination. Bond pads need to be kept 
very clean for wire bonding. Therefore 
vacuum pickup tools are used and the 
sensors are kept in clean rooms. 


The guard rings are typically floating, 
and acquire an appropriate potential 
by ‘punch-through’. The potential of 
the floating implant adjusts so that it’s 
just enough that the depletion zone from 
the adjacent (biased) implant extends 

to it and the potential difference is high 
enough to make charge carriers flow 
from the floating implant to the adjacent 
(biased) implant. 


A Schottky contact is a contact between 
a metal and a semiconductor. It results 
in a barrier for charge carriers due 

to the continuity of the Fermi levels 
between the two materials. The height 
of the barrier depends on the doping 
concentration in the semiconductor. 
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photoresist is exposed by a UV source through a mask and then the ex- 
posed (or non-exposed) photoresist is removed chemically. Chemical etch- 
ing is used to remove the material in the gaps of the mask, thus producing 
the desired structure. This process is used for 


e aluminium for surface electrodes used for connections by wedge 
bonding; 


e polysilicon for creating resistors; 
e silicon nitride (or silicon oxide) to create coupling capacitors; 


e silicon nitride (or polyimide or silicon oxide) used to protect the 
sensors from mechanical or chemical damage. 


For silicon detectors we need to introduce selective p or n-type doping 
into the high purity silicon. A common way to do this is to use electro- 
static acceleration of suitable ions (e.g. B for p-type doping) to inject the 
ions into the silicon. The range of the ions in silicon varies with energy. 
Therefore the doping profile can be adjusted by varying the energy of the 
ions. The mask thickness for these steps is chosen to stop all the ions so 
that only the desired regions are doped [435]. 

One disadvantage of the p*-in-n approach is that most of the signal 
comes from drifting holes rather than electrons (see exercise 5). As dis- 
cussed in section 4.3 the mobility of holes is lower than for electrons, 
while the carrier lifetimes are similar for electrons and holes. Hence, as 
holes spend more time drifting than electrons, they are more likely to 
be trapped during their passage through the bulk (see section 9.11). For 
n*-in-n or n*-in-p sensors most of the signal comes from the drifting 
electrons and these types of sensors are therefore more radiation tolerant 
than p*-in-n sensors. 

Ionising radiation will create electron-hole pairs in the oxide. The high 
mobility electrons will drift and be collected by a positively charged elec- 
trode. However, the low-mobility holes move only until they reach deep 
traps at the Si/SiO2 junction. After further irradiation, this creates an elec- 
tron accumulation layer. This causes one issue with n* implants in that 
the electron accumulation layer will result in the n* implants becoming 
electrically connected (at the level of kQ). This can be avoided by intro- 
ducing p-stops or p-spray between the n* implants. For p™-in-n detectors, 
the accumulation will improve the inter-strip isolation and therefore the 
additional processing is not required. 
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Figure 9.3: Conceptual cross-section 
(perpendicular to the length of the 
readout strips) through an AC-coupled 
p-in-n strip sensor (not to scale). The 
bias voltage is applied to the aluminium 
backplane. 


A photoresist is a light-sensitive poly- 
mer that becomes soluble when exposed 
to UV light. 


Wedge bonding is a standard indus- 
trial process that typically uses 25 um 
thick aluminium wire for the electrical 
connection. The wire is connected to 

the aluminium electrodes by applying a 
pulse of ultrasonic energy and pressure 
which results in a welded connection. 
Wedge bonding is done using standard 
machines from the semiconductor indus- 
try, which use a semi-automated process 
to achieve a high throughput. 


Additional layers to the SiOz passi- 
vation layer are needed for practical 
sensors. 


The difference between p-stops and 
p-spray is the dose and depth of the 
doping. p-stops are made using ion 
implantation with a mask. p-spray 
is typically deposited at lower dose 
and does not require the use of an 
additional mask. It thus requires one 
less processing step. 
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Figure 9.4: Conceptual cross-section 
(perpendicular to the readout strips) 


through an AC-coupled n-in-p strip 


j Al l Al i Al } Al sensor with p-stops (not to scale). 
SiO, 


Al 


bias 


Again, to avoid excessive leakage currents or breakdown at the edges 
of the sensor guard rings are employed. For n*-in-p they again need to 
be on the side of the readout n* implants, but for n*-in-n they are also 
needed on the backplane, requiring additional processing steps for the sen- 
sor and thus increasing the cost [192]. 

For the first generation of the LHC silicon detectors the choice was 
to use the cheaper p*-in-n sensors for the large area micro-strip detectors 
and the more radiation tolerant and expensive n*-in-n for the smaller pixel 
detectors that need to survive higher fluences. However, for the upgrades 
for the HL-LHC, even the micro-strip detectors require better radiation 


tolerance and therefore, n*-in-p sensors will be used for both the micro- As discussed, n* -in-p sensors have 
š 3 better radiation tolerance than p* -in-n 
strip and the pixel detectors there. sensors but do not require double-sided 


processing and are therefore cheaper 
than n* -in-n sensors. 


9.3 SILICON STRIP DETECTORS 


In silicon strip detectors the readout is segmented into strips that are typi- 
cally of the order of 100 um wide and several cm long. This design aims 
for good position resolution perpendicular to the strips while limiting the 
overall number of readout channels. The drawback of this geometry is that 
it does not give useful information about the location of hits along the di- Exercise 2 investigates how the position 
f . resolution in the direction of the strips 
rection of the strip. can be improved. 
In a strip sensor the aluminium readout strips are connected to the 
readout ASICs using wedge bonding. The width of these bonds makes it ASIC is the acronym for application- 
i i . = specific integrated circuit, a custom- 
difficult to reduce the pitch below ~50 um on the width of the strips that designed electronic chip. 
can be used. 
The connection of the bias voltage requires very good filtering close to Pi will typically be some type of RC 
š k è g $ er. 
the detector as we use single-ended amplifiers as discussed in section 3.3, 
which makes the readout very sensitive to noise on the high voltage (HV). 


For this type of single-sided detector the HV is connected directly to the 


backplane (i.e. the opposite side to the junction) of the detector. The aluminium layer allows easy elec- 

$ trical connection to the detector (using 

For AC coupled detectors, a DC conductive path for the leakage cur- either conductive glue or wedge bond- 
rent is required, which is provided by a bias resistor. As each strip needs ing). 


such a resistor it is not feasible to use discrete components, so the resistor 2 Foo ie oe detector has 
. . “4: . . channels. 
has to be built into the silicon sensor. A common way to do this is using 


polysilicon resistors which have sheet resistances of (100 kQ/O)!. To Polysilicon is a high purity polycrys- 
talline form of silicon. 


‘Sheet resistance is the resistance of a square piece of a thin material with contacts made 
to two opposite sides of the square. It is common to use the unit Q/U or ‘ohms per square’, 
which is dimensionally equal to an ohm, but is exclusively used for sheet resistance. It is 
obtained when the resistivity of the material (in Qm?/m) is divided by the thickness of the 
conductor (in m). 
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achieve good coupling of the signal into the amplifier the resistance needs 
to be much larger than the input impedance of the amplifier at signal fre- 
quencies. We also want to minimise the thermal noise in the resistor (see 
eq. (3.5)). However, too large a resistance would result in excess voltage 
drop and power consumption. Typically a resistance of @(1 MQ) is used. 
This requires a very high ratio of length to width, which is achieved with a 
long meander. 
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SILICON DETECTOR SIGNALS 


As the depleted region is not very deep, the duration of the signals is short 
(of order 10 ns), and the readout electronics is usually not fast enough to 
resolve the time structure of the signal. 

The drift field in a silicon detector resembles the field in an ionisation 
detector as discussed in section 3.1 over most of the volume, apart from 
the region around the strip implant, where the density of field lines in- 
creases. The weighting field for one strip is more complex, as it extends 
into the volume depleted by the adjacent strips. 

As a consequence, a charge induces a current signal on the strip on 
which it will arrive that is unipolar and that increases as the charge gets 
closer to the strip. On the other hand, a mobile charge (electron or hole) 
liberated close to the backplane in the volume covered by an adjacent strip 
will first create a similar signal, but as the charge is approaching the ad- 
jacent strip the polarity of the induced current signal will reverse, as the 
drift and the weighting fields become anti-parallel (see Figure 9.7). The 


net charge induced in the first case if the drifting charge is starting close to 
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It is difficult to achieve a very precise 
value of the resistance. 


Figure 9.5: Photograph of a polysilicon 
resistor from a CMS sensor for the 
HL-LHC upgrade [319]. 


Figure 9.6: Schematic cross-section 
(along a readout strip) showing the 
edge region of one channel of a silicon 
microstrip detector [319]. The three 
regions of SiOz shown are grown at 
different stages of the fabrication. 


One reason to keep the drifttime short 
is that this reduces the likelihood of 
charge trapping, and thus improves the 
charge collection efficiency. 


150 


| 


> 
Seal od 
EEES fo] 


bad 
o 
R 


ò 
get 
& 


d 
£86 
5 
Sg 


Vo = 1 
Xx Distance from strip plane 


the backplane will equal the drifting charge, whereas it will be zero in the 
latter. 

As in an ionisation event electrons and holes are produced together 
both contribute to the signal. The dominant contribution comes from the 
type of charge carrier that drifts to the implant strip, because of the high 
value of the weighting field there, i.e. holes in the case of p* strips and 
electrons for n* implants. Therefore, the time structure is affected by the 
different mobilities for the two types of charge carriers. 
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The time response of a simple silicon pad detector to minimum- 
ionising particles for different bias voltages is considered in exercise 5. 


ELECTRONIC READOUT 


We have discussed the optimisation of the amplifier readout for low noise 
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Figure 9.7: Left: Weighting potential in 
a strip detector (strip width and strip 
separation are 1/6 of the anode/cathode 
distance). Right: Vertical component of 
the weighting field of the strip system on 
the left along the three lines indicated. 


An experimental technique to study 
properties of silicon detectors like 

the field distribution, the efficiency of 
charge collection, the drift velocity 
and the lifetime of charge carriers, is 
the transient current technique (TCT), 
which is the measurement of time- 
resolved current pulse shapes in silicon 
detectors induced by laser light pulses 
(with wavelengths of 660 or 1064 nm) 
of sub-nanosecond duration and a pulse 
power corresponding to from a few to 
about 100 minimum ionising particles 
(see for example [318]). 


Figure 9.8: Strip detector signals for an 
p-in-n device with segmented strips on 
the p and n-sides of the sensor with 60 V 
depletion voltage operated at a bias 
voltage at 90 V for a uniform ionisation 
density in the sensor [465]. Shown are 
the electron (e) and hole contributions 
(h), and the total signal (thick lines). 


in section 3.6. As shown there, a cascode amplifier has the benefits of high 
gain and high bandwidth, and is therefore well suited for the pre-amplifier 
in a silicon system. 
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In silicon detectors it is common not to use a standard differential ar- 
chitecture of the amplifier chain, but single-ended inputs. The major ad- 
vantage is lower power for the same noise performance. The fundamental 
noise for a differential amplifier is v2 times larger than for an equivalent 
single-ended amplifier (we assume that there is no correlation in the noise 
for the two inputs). Therefore, to achieve the equivalent noise performance 
we would need to increase the current for each transistor by a factor of 
two. The resulting differential amplifier would need 4 times the current as 
for the equivalent single-ended transistor. 

The drawback of the single-ended approach is that the amplifier is 
more susceptible to interference noise, such as noise from the power sup- 
ply. This is quantified by the Power Supply Rejection Ratio (PSRR), 


Power Supply Variation 
Input Offset Voltage Variation / ` 


PSRR = 20log ( (9.5) 
The value of the PSRR quantifies how supply power fluctuations create a 
change in the input offset voltage. It is essential to use a suitably low-noise 


power supply and to have low-pass filtering on the power lines near the 
amplifier ASIC. 


DATA TRANSFER 


After suitable amplification there are three options for a silicon detector 
system for how to process the data on the detector and transmit it to off- 
detector data acquisition systems: 


e Binary: For each detector element a digital signal with two possible 
states, logical ‘0’ (no-hit) or ‘l’ (hit), is produced. 


e Digital: The signal from the output of each amplifier is digitised 
with more than one bit, and the digital output is transmitted. 


e Analogue: The amplitude of the pulse is converted to a proportional 
optical signal that is transmitted over optical fibre to the off-detector 
data acquisition system where it is digitised. 


The first option is the simplest. In the most naive version the output 
of the amplifier for each detector element (strip or pixel) is compared to a 
common discriminator level for all channels in one front-end ASIC. The 
problem is that there are usually significant channel-to-channel gain varia- 
tions and these need correcting in order not to degrade the effective S/N. 

For binary readout, the discriminator threshold is typically set by a 
Digital-to-Analogue Converter (DAC). It is not practical to have an inde- 
pendent DAC with the required number of bits (i.e. dynamic range) for 
each element. This problem can be solved by having a global DAC for 
each ASIC with the required number of bits and adding a ‘Trim DAC’ for 
every channel with a limited number of bits that are sufficient to compen- 
sate for the channel-to-channel variations. 

As we have seen in chapter 3, there is a trade-off between speed, 
power and S/N. For the case of LHC detectors, there is an irreducible 
background of signals from additional proton-proton collisions in the 
triggered bunch crossing (called ‘in-time pileup’). However, if the pulse 
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The noise scales as \/1/gm and the 
transconductance gm scales like \/Tq. 


This expression gives the PSRR in dB. 


The input offset voltage of an amplifier 
is the voltage that must be applied 

between the two input terminals of the 
amplifier to obtain zero output voltage. 


The output is typically the result of a 
comparison with a threshold. 


This architecture is used extensively in 
fields where amplitude information is 
required, e.g. in spectroscopy. 


Crucially, it is also the cheapest option 
and has been adopted by both ATLAS 
and CMS for the strip tracker upgrades 
for the HL-LHC. 


Front-end refers to the electronics 

on the detector, in this case directly 
coupled to the silicon sensors. Back-end 
refers to the electronics in a ‘counting 
room’ which will be @(100 m) away 
from the detector and not be subject to 
radiation. 


For example the readout ASIC for the 
ATLAS SemiConductor Tracker (SCT) 
has an 8-bit DAC and a 4-bit-trim 
DAC [177]. 
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shaping is too slow, there will be genuine signals from the wrong bunch 
crossing (‘out-of-time pileup’). 

One strategy used at LHC to minimise the power consumption and 
achieve the required performance is to allow the pulse-shaping to be such 
that signals extend over two bunch crossings. However, only strips which 
have a hit in the triggered bunch crossing and no hit in the preceding 
bunch crossing are read out into the buffer. 

The second option, digital readout, requires a low power ADC for ev- 
ery channel. It enables the measurement of variations in the baseline level 
across a sensor/amplifier and thus slow drifts in time can be measured and 
corrected for. However, this option increases the power consumption and 
complexity for the ASIC compared to simple binary readout. The result- 
ing increase in chip area and the higher bandwidth required for the data 
transmission to the counting room increase the costs. 

The third option, analogue readout, which for example was used by 
CMS for the first generation of their silicon tracker, is attractive in that 
it minimises on-detector electronics and retains the possibility of cor- 
recting for baseline fluctuations in the off-detector receiver electronics. 

In the CMS case the analogue data is transmitted at 40 Msamples/s with 
an equivalent resolution of 8 bits. The data transmission requires ana- 
logue optical links with excellent linearity. While this system works well, 
the two key disadvantages with respect to the binary option are that the 
edge emitting lasers (EELs) used for the transmission are not as radiation- 
tolerant as the vertical cavity surface emitting lasers (VCSELs) commonly 
used for transmitting digital data and that it requires a much larger number 
of optical links. 

Digital or analogue readout allow to calculate the charge-weighted 
centroid of a cluster of strips. This will result in an improved spatial reso- 
lution compared to that of the simple binary readout. 


PIPELINE AND BUFFER 


In applications with low channel counts and low rates each detector chan- 
nel can simply be connected to a data transmission channel. However, at 
hadron colliders such as LHC the events are acquired at the beam bunch 
crossing rate of 40 MHz and it is generally not practical to read out the de- 
tectors at this rate. A trigger system is therefore required to reduce the data 
rate. The first trigger level, which is typically implemented in hardware, 
requires several us to make a decision as to whether to accept or reject an 
event corresponding to a particular bunch crossing. Therefore the data are 
stored on the detector in a ‘pipeline’ memory while the trigger decision is 
being made. 

A pipeline memory can be implemented as a circular memory. For 
the case of LHC detectors, data corresponding to each bunch crossing are 
written into a different cell, i.e. the write pointer advances by one cell ev- 
ery bunch crossing. If the depth of the pipeline is N, then after N bunch 
crossings new data is written into the cells. The external hardware trigger 
must decide whether to keep or reject the data from every bunch crossing. 
Clearly this decision must be made in less than a period equal to N times 
the spacing between bunch crossings. 
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A buffer in this context means a memory 
used to temporarily store data before 
the data is transferred to another lo- 
cation. Therefore the use of the slower 
and lower power pre-amplifier does not 
increase the ‘pile-up’ from the previous 
bunch crossings. At high luminosity 

it will cause a small decrease in hit 
efficiency, O(1%). 


Good linearity enables the use of a 
simple calibration scheme. Electrical 
data links with this bandwidth would 
not be practical. This was achieved 
using EELs operating at 1310 nm and 
single-mode fibre. 

VCSELs operate at 850 nm and are 
coupled to multi-mode fibres. 


For a discussion of triggers see chap- 
ter 13. 


For the LHC bunch crossing frequency 
of 40 MHz a typical pipeline depth 

N = 128 gives 3.2 us for a trigger 
decision. 
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If the data from a particular bunch crossing is accepted by the trigger, 
this bunch crossing is flagged and the data are not overwritten. The corre- 
sponding data are read out into a “de-randomising’ buffer for further pro- 
cessing and readout to the off-detector electronics. For the case of binary 
or digital data, this processing will involve ‘sparsification’, i.e. only read- 
ing out hit cells. In addition to further compress the data, neighbouring 
hits are gathered into clusters and the cluster information rather than the 
data from individual elements is read out. For a binary system this does 
not represent a loss of data, it is just a more efficient representation. 


9.4 CALIBRATION OF SILICON STRIP DETECTORS 


In large particle detector systems with millions of channels it is essential 
to determine optimised settings for the operational parameters (thresholds 
etc.) for each channel with an appropriate calibration system. These sys- 
tems usually have too many channels to calibrate with sources or particle 
beams and therefore the calibration circuits are integrated into the readout 
ASICs. 

Calibration is required to ensure the full functionality of all modules 
before installation in the overall detector. In addition, there will be vari- 
ations in the performance of the modules over time, due to slow drifts in 
the electronics, for example due to changes in environmental conditions 
(typically temperature). If the detector is exposed to significant radiation 
(e.g. in the LHC detectors) the performance of the sensors and associated 
front-end electronics will be degraded over time. It is essential to track 
these changes and to adjust operational parameters. In principle this could 
also be achieved using data with and without beam present, but this would 
take far too long and would waste valuable beam time. Therefore built-in 
electronic calibration systems are used. 

An accurate calibration is required for all channels of a silicon de- 
tector to adjust for channel-to-channel variations in the sensor and elec- 
tronics. Calibration systems are required to measure the gain and noise 
for all channels as well as, where possible, the response with no signals 
(‘pedestals’). To know that a module is working and that the operational 


parameters are set correctly, we need to be able to measure the response to 


a known input charge to confirm a high efficiency for detection of charged 
particles. We also need to be be able to measure the ENC in absolute 
units (fC or number of electrons) to verify a satisfactory S/N. An accu- 
rate calibration is required for all channels of a silicon detector to adjust 
for channel-to-channel variations in the sensor and electronics. 

We will discuss the calibration of analogue and binary systems. The 
calibration procedures are different for silicon strip systems with analogue 
or binary readout. 


CALIBRATION OF ANALOGUE SYSTEMS 


We will consider the calibration of the CMS silicon tracker [196] as an 
example of the calibration procedures required for an analogue system. 
The CMS silicon detectors use analogue optical links to transfer the data 
from the output of the front-end ASIC (APV25) to the back-end elec- 
tronics in the counting room. There the signals are digitised at a rate of 
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In general, arrival times of data at 
the processing facility are random. 
Without buffers, the processors remain 
idle after reading out one event while 
they wait for the next, and the system 
is inefficient. Adding buffers equalises 
the rate with which data arrives, with 
the result that the processors are kept 
busy nearly full-time, as if they were 
presented with a continuous stream of 
events matched to the processing rate 
(‘de-randomised’). See also [295]. 


The ATLAS and CMS silicon strip 
trackers have G(10") channels and the 
pixel detectors have 6(10°) channels. 


By a module we typically mean a silicon 
sensor and the associated readout 
circuitry. 


The electronic calibrations can be 
performed in periods without beam and 
therefore do not waste any beam time. 


An example of a calibration proce- 
dure for a digital system is described 
in [193]. 
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40 Msamples/s, with an 8-bit resolution. Baseline levels with no signal 
(these are called ‘pedestals’) are measured for every strip using data with 
no beam. These pedestal values are subtracted from the measured data 
during physics operation for each channel. In addition, the median level of 
the ADC values for all the channels in each ASIC is measured and used to 
correct for common-mode noise. 

The next aspect to the calibration is to determine the overall gain. 
This depends on the optical system as well as the APV25 and sensors. 
For this purpose, each event readout contains a ‘tick mark’ at the end of 
the analogue data that corresponds to a large and known charge, see Fig- 
ure 9.9. The gain is measured using the ADC response to the tick mark af- 
ter pedestal subtraction. The variation in gain calibration is further reduced 
using the measured pulse heights of clusters on tracks from collision data. 
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This approach was validated at test beams using particles of known 
type and energy. The final validation uses in-situ data, for example cosmic 
ray muons (before the LHC started operation). The S/N of clusters of hits 
on reconstructed tracks is consistent with expectations. 


g r . ye Indf 73.65/78 
= p if Width 1.903 = 0.033 
c 1000F- it MP 26.91 + 0.04 
LW T i" | Area 1.464e+04 + 91 
I 4 GSigma 2.675 + 0.067 

800- I| 7 

C ? i CMS 2008 |] 

L | 4 J 

i 

600- if \ eal 

L ° } J 

L ; } J 

l \ 4 

| 

400- l i a 

H | s q 

[ l y ] 

l o 

200- ] % 7 

L d J 

L è 4 

Olana dordar 


O 10 20 30 40 50 60 70 80 90 
Signal-to-noise 


Detectors in Particle Physics 


Common-mode noise is coherent noise 
which has the same value for all chan- 
nels in a given event (see section 3.2). 
It can vary from event to event and one 
of the attractive features of the ana- 
logue readout is that it allows for this 
common-mode noise correction. 


Figure 9.9: APV25 raw data in ADC 
counts, showing a digital header, 
analogue data and a tick mark at the 
end. The analogue data corresponds to 
the multiplexed data for one triggered 
event from two ASICs, with one channel 
carrying a signal from a minimum- 
ionising particle (MIP) [197]. 


By in-situ data we refer to data that is 
collected during operation of the full 
detector. 


Figure 9.10: Distribution of S/N for 
clusters in the innermost barrel strip 
detector layer of the CMS tracker [197]. 
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CALIBRATION OF BINARY SYSTEMS 


Here we will consider the calibration system of the ATLAS ITk strip de- 
tector [404]. The calibration of binary readout systems is not as straight- 
forward as for analogue systems. In a binary system, we can measure 

the ratio of the number of hit strips over the total number of strips (this 

is called the ‘occupancy’) for a given value of the discriminator thresh- 
old. If the noise has a Gaussian distribution, the variation of occupancy 
with threshold will be of the form of an error function. From a fit of the 
occupancy as a function of threshold we can determine two parameters: 
the value of the threshold at which the occupancy is equal to 50% (called 
VT50 in ATLAS) and the o of the Gaussian noise distribution. 
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This scan is repeated for different values of known input charge, Q;, 
and the corresponding values of VT50; and o; are determined from an 
error function fit for each of the scans. The gain at an input charge of q; 
can then be determined as G; = VT50;/Q;. In general this relation will not 
be linear. 

The values of the Gaussian 0; give an estimate of the noise at the out- 
put of the amplifier. With the gain obtained from the calibration, the input 
noise can be found from Ojnput,i = O; /Gi. 

Once the gain and input noise are known, the discriminator threshold 
can be set low enough to give a high efficiency for MIPs, but large enough 
to be far enough above the noise. 

The only way to be sure that these complex calibration procedures are 
correct is to study the response to charged particles. Therefore, the per- 
formance of modules is validated in test beams in which modules are ex- 
posed to particles of known energy. An example is shown for an ATLAS 
SCT module in Figure 9.12. The plots show the hit efficiency (for particle 
detection) and noise occupancy as a function of threshold for an unirra- 
diated module [176]. The results show a wide range of threshold values 
which achieve a hit efficiency above 99% while maintaining a noise occu- 
pancy below 1074. 
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Ina binary system we lack the data to 
determine a pedestal or the response 
to a known input charge from the mea- 
sured ADC values. 


The error function is the integral of the 
Gaussian distribution. 


Figure 9.11: Simulated scan of occu- 
pancy as a function of discriminator 
threshold and the error function fit for 
the ATLAS ITK strip system [404]. 


The charge is injected by a fixed voltage 
over a capacitor of known value in the 
front-end ASIC. 


The amplifier is designed to give a 
linear response for typical signals 
expected from MIPs, but to have lower 
gain for larger charges. This results in a 
lower power consumption compared to 
an amplifier that is linear over the full 
range. 


This calibration procedure needs to be 
performed on all channels. Therefore 
each channel has an adjustable dis- 
criminator threshold and its own charge 
injection circuitry. 
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This validates the intrinsic module performance and the calibration 
systems. 
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9.5 LEAKAGE CURRENT 


The signal in silicon detectors is the current induced by the charge carriers 
that have been promoted to the conduction band or created in the valence 
band by the interaction with the passing charged particle. However, even 
in the absence of such an interaction and even if the pn junction is fully 
depleted, a current will flow due to thermally created electron/hole pairs. 
Leakage current will be present in all silicon detectors, but it increases 
significantly after irradiation. 

We consider fully depleted detectors, i.e. we have used an electric field 
to remove the mobile carriers. The leakage current arises from thermal 
energy promoting electrons from the valence band to the conduction band, 
as any electrons in the conduction band will drift under the influence of 
the electric field. 

Again, we will focus the discussion on silicon. Crystalline silicon is an 
indirect band gap semiconductor. Hence transitions of electrons from the 
top of the valence band to the bottom of the conduction band are highly 
suppressed by the need to simultaneously conserve energy and momen- 
tum. Therefore the dominant mechanism for electrons to be promoted to 
the conduction band is via energy levels within the band gap. Such energy 
levels will exist because of defects in the silicon crystal but they are also 
created by radiation damage. 


GENERATION-RECOMBINATION THEORY 


In this section we will first consider the steady state achieved when 
charges are continuously injected into the conduction band. 

There are four processes involving transitions from the conduction or 
valence band to/from trap centres (energy levels in the forbidden gap): 


e Emission of negative carriers (electrons) from a trap centre to the 
conduction band with rate r7; 


e Capture of negative carriers (electrons) by a trap centre from the 
conduction band with rate rt; 
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A similar study was performed for an 
irradiated module and that showed the 
expected degradation from radiation 
damage to the sensor and ASIC. How- 
ever, it was still possible to find a suit- 
able operating window (i.e. threshold 
setting) in order to achieve the required 
low noise and high hit efficiency [176]. 


Figure 9.12: Plots of efficiency versus 
threshold (upper data points and left 
hand axis) and occupancy versus 
threshold (lower data points and right 
hand axis) for an unirradiated SCT 
module [176]. 


This current is called ‘leakage current’ 
or ‘reverse bias current’. 
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Such intermediate states are called 
‘traps’. 


For example, this could be done by 
shining laser light into the silicon. The 
theory was first worked out in [277] and 
[456]. Our discussion is based on the 
textbooks [316], [349] and [464]. 
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e Emission of positive carriers (holes) from a trap centre to the con- 
duction band with rate 72; 


e Capture of positive carriers (holes) by a trap centre from the con- 
duction band with rate r?. 


The rates for these four processes will be proportional to the number of 
available states. For capture of negative carriers by a trap centre, this in- 
volves the volume number density of negative charges, nn, the volume 
number density of trap centres, nų, and the occupation probability for 
the trap, f. The volume number density of the available trap centres is 
n(1—f). 

Capture of positive carriers by a trap centre is equivalent to the emis- 
sion of a negative carrier from the trap centre. The rate is therefore propor- 
tional to the number of occupied trap centres m,f and the volume number 


density of positive carriers, np. Therefore we can write down the capture The occupation probability is given by 
a Fermi-Dirac distribution, however 


rates per unit volume of negative and positive charges as we can safely approximate this by the 
Maxwell-Boltzmann distribution as in 
n section 9.1. 
re =Cynym(1—f) and rẹ =cpnpmf, (9.6) 


where c, and cp are constants given by the product of the speed v and the 
appropriate capture cross-sections, 


d 
Cn =f T odv, (9.7) 
o dv 


where dn /dv is the distribution of speeds, with the equivalent definition 
for cp. 

We can write down the equivalent formulae for the emission probabili- 
ties of negative or positive carriers 


re = ennf and rẹ =epnpn(1—f), (9.8) 


where the emission probabilities e„ and e, are constants that we can deter- 

mine by considering the situation without charge injection, i.e. in thermal 

equilibrium. In this case the rates of electron (hole) capture and electron 

(hole) emission must be identical. This is the principle of detailed balance. 
In n-doped silicon, we can write the negative carrier density in terms 

of the intrinsic carrier density n; as 


Hak 
n= ni exp (57) , (9.9) 


where Ep (Ep) is the Fermi level for intrinsic (doped) silicon (see exer- 
cise 4). Combining this result with equations 9.6 and 9.8 gives relations 
between emission probability and capture coefficients, 


Et- Er 
En = Cyl EXP | — kT 


E — Eg 
ep = Cpi €Xp aT 7 


(9.10) 


When we are injecting charge at a rate Rinj we no longer have thermal 
equilibrium but there is still a steady state, i.e. the rate at which electrons 
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are transferred to the conduction band is equal to the rate at which they 
leave the conduction band (with an equivalent condition for the positive 
holes), 


non 
Rinj +e — re = 0, 


9.11 
Rig trle =r =0, on 


We can combine the two expressions in eq. (9.11) and substitute from 
equations (9.6) and (9.7) to solve for the occupation probability f in non- 
thermal equilibrium 


Catnn(1— f) —entamf =cpnprif —epnpm(1—f), 


Ep Ny +Cpn 
f= Pop . (9.12) 
CnNn + CpNp + ennn + CpNp 
We can now calculate the net recombination rate, Gi =r? — r}, 
G;/m% = Cnn — f (Cnn T enn). (9.13) 


Substituting for f from eq. (9.12) gives 


Gi Cnn (Cnn F CpNp F Eennn + epnp) a (epnp ag CnMn) (Cnn 2g ennn) T 
ny CnNn + CpNp + Ennn +e pNp 
CpCaNnNp — CpenMnNp 


= ; (9.14) 
CnNn + CpNp + ennn + CpNp 
Eliminating e, and e, using eq. (9.10) yields 
2 
Gi = CpCn (nnp = ) (9.15) 


no E-E B-BN\ 
t Cn (nn + miexp ( cet) ) + Cp (np t niexp ( ce) 


We can simplify this result if we make the approximation cn ~ cp, and 
we can approximate the result for c, from equation 9.7 as cy, ~ vo where v 
is the mean thermal velocity and o the effective cross-section, 
Nnnp — n? 


E-E \ 
Nn + Mp + 2ni cosh ( ir) 


G = Ovn (9.16) 


We can now combine the results from sections 9.1 and 9.5 to deter- 
mine the variation of leakage current with temperature. We consider the 
situation of a region of the detector that is fully depleted of charge car- 
riers (e.g. by applying a reverse bias voltage to a pn junction). Initially, 

Ny = Np = 0 and we can calculate the rate of creation of carriers (electrons 
and holes will be created in equal numbers) per unit volume, 


d. š 
T oyim” (9.17) 
dt 2cosh Gas ) 
The leakage current in a volume V is 
dn 
Teak = =~ eV, (9.18) 


dt 


Detectors in Particle Physics 


In thermal equilibrium the recombi- 
nation rate is equal to the generation 
rate for negative and positive charges 
ri=riandre=re. 
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and substituting from eq. (9.17) 


ni 
Ee—Et 
2cosh ( aor) 


We can see that the leakage current will be dominated by trap levels 
with an energy F; close to the Fermi energy Er. If we assume that Ef ~ 
E,, then the temperature dependence of the leakage current variation with 
temperature is given by the variation of n; (eq. (9.2)) and v (« VT). 

With these approximations we find for the dependence of the leakage 
current (Eg is the band gap energy) 


In case the trap levels are further away from the intrinsic Fermi level 
the cosh function in eq. (9.17) behaves like exp|— (Ef — E+) /(kgT )], and 
thus it is common to replace the band gap energy in eq. (9.20) with 


Teak = O V ev. (9.19) 


E 
Deak ~ AVT? exp (- £ (9.20) 


2kg 


Eett = Eg +2A, 


where A ~ Ef — E 
The band gap energy Eg is itself a function of the temperature and can 
be parameterised as 
Es(T)=2o— or. 


For silicon, Ey = 1.206 eV and a = 2.73 x 1074 eV/K. For the temper- 
ature dependence of the leakage current this means that the temperature 
dependence can be described by 


Ett 2 Ta 
= T — — 
a VEER oe) 
with Eet = 1.21 eV [191]. 


It is common to quote the leakage current at a reference temperature 
(often 0 °C). It can then be scaled to the appropriate temperature using 
eq. (9.21). As a rule of thumb, the leakage current doubles for an increase 
in the temperature of about 7 °C. 

The leakage current affects a silicon detector system in several ways: 


Teak « VT? exp (- (9.21) 


e The readout electronics needs to accommodate the constant leakage 
current. For strip detectors the readout is typically capacitively cou- 
pled, so that the DC component of this current is disconnected from 
the readout electronics; 


e Statistical fluctuations due to the discrete nature of the moving 
charge carriers lead to shot noise, which scales with \/[ieak (see sec- 
tion 3.2); 


e The leakage current and the bias voltage across the pn junction re- 
sult in ohmic heating of the silicon. This heat needs to be removed 
by some means of cooling. Otherwise the temperature of the junc- 
tion will rise, and thus the leakage current will increase even further 
and the detector will become thermally unstable and go into ‘ther- 
mal runaway’. 
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It is common to write leak = enV /Tg, 
where T, is defined as the generation 
time constant. This is the exponential 
decay constant that describes the return 
to equilibrium ny = np = ni. 


We neglect any variation of 6 with v 
and the temperature dependence of the 
effective carrier masses. 


A is a proportionality constant. 


Ta = Eeg/(2kg) =~ 7000 K is called the 
‘activation temperature’. 


Practically, thermal runaway is limited 
by the ability of the power system to 
supply the current to maintain the 

bias voltage at the required sensor 
temperature. 
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The strong variation of leakage current with temperature implies that 
for irradiated detectors we must limit the effects of leakage current to a 
manageable level by operating the silicon detectors cold. 


9.6 THERMAL MANAGEMENT OF SILICON DETECTORS 


Because of the low mass in silicon tracking detector systems their heat 
capacity is small. At the same time, they typically include significant 
heat sources from on-detector front-end electronics and the temperature- 
dependent sensor leakage. The thermal management of silicon detector 
systems therefore has to achieve three requirements: 


e Remove the heat generated by the front-end electronics and sensor 
leakage; 


e Maintain sufficiently low temperatures to limit reverse annealing 
effects (see section 9.11), reduce shot noise due to sensor leakage 
current, and provide the required temperature for the front-end elec- 
tronics components; 


e Provide enough heat removal capacity at sufficiently low tempera- 
ture, so that thermal stability is achieved. 


These requirements are usually achieved by means of a local heat sink 
(typically a coolant) at an appropriate temperature, and a highly conduc- 
tive heat path from the heat sources to the sink. 


COOLING SYSTEMS 


The typical power in silicon systems leaves only forced convection as a 
practical means to remove the heat from the silicon detector system over 
the large distance to an off-detector heat sink. Coolants can be in a single 
phase, or they can absorb the heat in a phase change. Any coolant must be 
chemically inert. 

Monophase coolants can be liquid or gaseous. In the latter case the 
specific volumetric heat capacity is lower, limiting its use to systems with 
low front-end power, but the lower density is attractive if a low mass of 
the silicon system is one of the driving requirements. The heat transfer 
from the coolant to stationary parts of the heat path (e.g. the cooling tube 
wall) is higher for turbulent flow (Re = 2400). In monophase cooling sys- 
tems the temperature of the coolant increases along the contact to the heat 
source. 

Phase-changing cooling is attractive for the large latent heat typically 
associated with the phase transition. The most common type of phase- 
changing cooling in silicon detector systems are evaporative systems. 
Evaporative cooling systems can be pump-driven, compressor-driven or 
gravity-driven. In pump-driven systems (see for example [484]), the chal- 
lenge is the subcooling required to prevent the pump from cavitating. In 
compressor-driven systems (see for example [105]) it is the need for oil- 
free compressors with a high compression ratio, as lubrication with oil 
risks contamination of the thin on-detector cooling pipes. Gravity-driven 
systems (see for example [118]) need a considerable height difference (or- 
der 100 m) to achieve the required drive pressure for typical coolants. 
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Thermal stability means margin against 
thermal runaway. 


Not only is the density of the gas lower 
than for a liquid, but often gas-cooled 

systems do not require a container for 

the flow. 


The Reynolds number Re is defined as 
Re = GD/ with G the mass velocity (in 
kg/m?s), D the diameter of the flow, and 
H the dynamic viscosity of the coolant 
(in Pa). 


Cavitation is evaporation caused by 
local pressure drops close to the fast 
moving parts of the pump, which makes 
it impossible for the pump to gener- 

ate pressure. The fluid needs to be 
cooled sufficiently below saturation 
(subcooled) to prevent this. 
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Figure 9.14: Conceptual pressure enthalpy diagrams for different cooling cycles [489]. a) compressor-driven. b) gravity-driven. c) pump-driven. 

The thin lines are isotherms for the coolant, and the bump is the saturation curve, separating, from low to high enthalpy, the liquid, two-phase and 
gaseous phase space, respectively. The loop direction is counter-clockwise in a) and clockwise in b) and c). The pressure control points in the three 
cycles are indicated by the thick arrows. The thin arrows indicate heat transfer via internal heat exchangers (HEXs), which can be used to increase 


enthalpy available for cooling (in a) and b)), or guarantee saturated conditions in the evaporator (in c)). 


Pressure-enthalpy (or p-h) diagrams are useful to describe loop systems as the enthalpy (h = u + pv, where u is the internal energy) of the fluid stays 
constant in a thermally insulated flow process, and pressure drops are caused by flow restrictions (for example the pressure drop due to flow in a 
pipe). Exchange of heat with the environment changes the enthalpy of the fluid by the amount of heat exchanged. 


In evaporative systems the evaporation temperature on the detector 
is controlled by the pressure in the return line on the outside of the de- 
tectors, either by a backpressure regulator or by a temperature-controlled 
accumulator in the cooling plant. In either case, the on-detector evapora- 
tion pressure is offset from the set value by the temperature drop corre- 
sponding to the pressure drop in the return lines from the detector to the 
pressure-setting element. Great care has therefore to be taken to reduce 
this pressure drop. 

In evaporative systems the heat transfer depends on the physical prop- 
erties of the coolant, but also on the geometry of the flow (the ‘flow pat- 
tern’), which changes as the coolant absorbs heat. The best heat transfer is 
achieved for a thin liquid film along the tube wall, with a vapour core (‘an- 
nular flow’). Once a significant fraction of the fluid has evaporated (typi- 
cally 50% to 80%), the liquid film will lift from the wall, resulting in mist 
flow. At this point the heat transfer characteristics deteriorate significantly, 
which usually constitutes the limit of operation for evaporative cooling 
systems. 

The physics of bi-phase flow is complex and no complete theory ex- 
ists. Computational fluid dynamics methods are hampered by the diffi- 
culty to model the interfaces between the phases, and thus a wide range of 
semi-empirical correlations have been developed, with limited prediction 
power [284], some of which have been included into software to calculate 
pressure drops and heat transfer properties in evaporative silicon detector 
cooling systems (for example CoBra [488]). 


The currently most commonly used 
evaporative cooling system design is the 
pump-driven, accumulator-controlled 
2PACL (2-phase accumulator-controlled 
loop) design with CO as cooling 

fluid [484]. The accumulator is a 
storage tank for 2-phase coolant in 
saturation, in which the pressure is set 
by controlling the temperature. 


The feature of the line that most strongly 
affects the pressure drop is the inner 
diameter of the line, as well as its 
length, and to a lesser extend bends, 
abrupt changes in diameter, and surface 
quality. 


This happens before complete evapora- 
tion (‘dry out’) of the liquid. 


Because of the limitations of the corre- 
lations used it is mandatory to perform 
representative prototype tests to improve 
these predictions. 
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CONDUCTIVE PATH 


Generally, heat conduction is a three-dimensional problem, but for bench- 
marking purposes the performance of the conductive thermal path can be 
characterised by a scalar thermal impedance. It includes the heat transfer 
at interfaces (including the interface from the coolant to stationary parts of 
the heat path). 

The thermal impedance of a simple heat-conducting element can be 
described by R =1/(KA), where k is the heat conductivity and / and A ge- 
ometrical factors (the length and cross-section of the heat path). Low ther- 
mal impedance can therefore be achieved by materials with high thermal 
conductivity, and geometries with short heat path and large cross-section 
and contact areas. 

A whole industry exists for the development of high-thermal- 
conductivity materials. In general, higher density materials will display 
better heat conductivity, but one class of materials that has high thermal 
conductivity, moderate density and excellent structural properties are car- 
bonised materials (carbon fibre, but also carbonised foams). A special 
class of materials are thermal interface materials (TIMs), which are typ- 
ically viscous substances used to improve the heat conductivity between 
two contacting solid surfaces by filling in microscopic features on the sur- 
faces, which otherwise would result in a small insulating gap. 


10000 » 


Low-density foams 
1000 “carbonized solids 


Carbon foams 
through-plane ~ Carbonized solids 
€T SÇ T High In-plane 
£ ceramics 
iad / = 


100 | —/ Silicon 
| Solid metals 
| 
JE 
10 | ; 
0.1 1 10 100 1000 10000 
Heat conductivity [W/Km] 


Geometrically, the thermal impedance can be reduced by bringing the 
local heat sink as close as possible to the heat sources, and in particular 
the sensors. One approach used in modern pixel detector systems uses 
micro-channels, which are small-diameter cooling channels in direct con- 
tact with the sensor. These can be etched channels in a silicon substrate, 
which has the benefit of matching thermal expansion to the sensor. The 
challenge here is the number of connections required due to the limited 
size of the substrates, and the need to achieve leak-tight fluid connections 
at the required pressure to the silicon material. 
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Sometimes a ‘thermal figure of merit’ 
(TFOM) is used, which is the inverse 
of the thermal impedance. Attempts to 
normalise the TFOM by multiplying 
with the sensor area are usually not im- 
proving understanding, as this property 
does generally not scale simply by area. 


For a slightly more detailed discussion 
see [489]. 


Figure 9.15: Radiation lengths versus 
heat conductivity for common groups of 
materials in silicon detectors [489]. 


See for example [220]. 
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SENSOR TEMPERATURE AND THERMAL STABILITY 


While the temperatures in a silicon detector system are generally de- 
termined by a three-dimensional heat flow, it is instructive to model the 
thermal behaviour using a simple linear network model using thermal 
impedances and heat sources. Temperatures then correspond to the poten- 
tials in the system. We will demonstrate this here for a system of a sensor 
(generating a thermal power Q) and associated front-end electronics gen- 
erating Qn. The first benefit of the linear network model is that the stan- 
dard network rules (Kirchhoff’s rules, Thevenin equivalents etc.) apply. 

The system is thermally stable if the heat removal capability is suffi- 
cient to remove the sensor heat. Mathematically, this is achieved if a solu- 
tion for the network equation 


(9.22) 


can be found, where Q(T7,) is the sensor leakage power at the sensor tem- 
perature 7;, and R; is defined as in figure 9.16. 


To= -25C 
R,= 3.109 C/W 
Rc = 2.164 C/W 
Q, = 6W 

To2 126 


Operating 
Point 


Ts [°C] 


As will be discussed in section 9.11, one of the consequences of ra- 
diation damage in silicon detectors is an increase in leakage current. It is 
therefore vital for the design of the thermal management in a high-rate en- 
vironment to understand the thermal stability of the system as the leakage 
current increases over the lifetime of the experiment. 

A relation between the leakage power at the reference temperature Qref 
and the sensor temperature T, can be found using eq. (9.21), 


a) Paz) 
—sa e ex A Een . 
Ri T; 3 T; Tief 


It is standard practice to display this curve in its inverted form, 
T; = T;(Qref) (even though it cannot be inverted analytically). 

The condition described in eq. (9.22) can be used to find an expression 
for the critical sensor temperature, 


Qref = (9.23) 


i 


Ty crit > To T T` (9.24) 
A 


Using the scaling law eq. (9.21) this gives the critical leakage power, 


TaT ot | ( l : )| 
e= ——_eh_exp | T, TF. Rt. 
Oref crit cn ry Ty)? Pita Tscrit Tree / 
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Figure 9.16: Linear thermal network for 
a simple silicon detector system (left) 
and its Thevenin equivalent (right). Rs: 
thermal impedance for sensor, R.: com- 
mon thermal impedance, R; = R; + Re: 
total thermal impedance, T.: coolant 
temperature, Ty: sensor temperature. 


To = T, + ReQn. 


Figure 9.17: The balance of dissipated 
and conducted heat. The solid curves 
represent sensor heats of Q(0°C) = 

2, 4 and 8 W. The dashed line repre- 
sents conduction by the thermal path. 
Numerical values are appropriate to 
the ATLAS SCT barrel module [121]. 


For Q(0°C) = 8 W (the topmost curve) 
no solution to eq. (9.22) exists, the sys- 
tem is thermally unstable and thermally 
runs away. 


This is often wrongly called the ‘ther- 
mal runaway curve’. In reality this is 
the stable sensor temperature curve. 
Runaway only occurs at the endpoint 
of this curve (where the slope becomes 
singular). 


‘Critical’ here describes the value 
of a parameter at which the thermal 
behaviour becomes unstable. 


See exercise 9. 
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at r as a function of power density for a 

25 a Pii ae Analysis prototype silicon detector module for 

20 T Measuremen the ATLAS Semiconductor Tracker 


endcaps [194]. 
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To find the critical parameters in terms of a given leakage power we 
start with the coolant sink temperature to achieve a sensor temperature 7; 


Ty i 4 
To =T,—R R 5 T ea 2 
e = T; — ReQn — RiQh (=) exp | A (= 3) (9.25) 


At the critical point (onset of runaway) (d7,/dT7;)|7,= 
for the critical sensor temperature 


= 0. This yields 


Ts crit 


Teef 
2 š 
1— Te in (ee, 
Ta RQref TA 


with Ti = Ta + 2T, ~ 7500 K. Inserting this in eq. (9.25) gives for the 
critical cooling sink temperature 


Ts crit > 


(9.26) See exercise 9. 


T. : 
Te crit = Ts crit (1 = Be) —R.Qy. 
A 


Figure 9.19 summarises all the results in this section in one figure us- 
ing the example of the ATLAS SCT module. 


10 4 Figure 9.19: Sensor isotherms (full 
35 -15°C -10°C lines) and lines of equal strip current 
54 30 -5°C oc (dashed lines) for the ATLAS barrel 
— 5°C — 10°C SCT module from the minimal model 
0 25 — 5C 29°C (R; = 0.945 K/W, Re = 2.164 K/W, 
— 25°C 35°C Qn = 6 W, sensor area 163 cm?, strip 
=5 =---0,5uA  ----1pA area 10.7 mim, Vpias = 460 V) as a 
5 =--=-2uA =--=-3uA function of leakage power density and 
Os -10 Tccrit coolant temperature. The area above the 
z red line is excluded by thermal runaway. 
Crosses with numbers indicate sensor 
temperatures on the boundary line 
[121]. 
0 50 100 150 200 
aret at 273K [WW/mm?] 
While we have discussed a specific linear thermal network in this sec- 
tion, the model is relatively generic and can be used for many thermal 
designs, and the conclusions can be used generally. It is strai ghtforward A 3D solution using FEA is not possible 
to extend these concepts to create more complicated networks, which for such problems anymore, and the 


can also include other non-linear power components. At this point, the ENR model isthe anireriainn 
calculation tool. 
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model does not necessarily have analytic solutions anymore, but can still 
be solved numerically with little effort (see for example [123]). 

The numerical values for the thermal impedances in the model can be 
obtained from FEA, or from measurements of the sensor temperature for 
known leakage power configurations. For the best estimate of the thermal 
impedances in the network model the average of the temperatures over the 
silicon sensor should be used [123]. 

The linear network equivalent can also be used to estimate the dy- 
namic response of the thermal design. The time constant for a response 
can be estimated from t = RC, where R is the thermal impedance and C 
the heat capacity of the silicon module [121]. 


9.7 PIXEL DETECTORS 


For many experiments at accelerators, to cope with the high track densi- 
ties close to the interaction vertices and for improved resolution of their 
spatial reconstruction, smaller size detector elements are needed. Pixel 
sensors have a typical element size of 50 to a few 100 um per side. The 
pixel cell is based on the same type of reverse biased junction as used for 
strip detectors. However, there are many additional technical challenges 
that have to be solved for pixel detectors, in particular for high-rate and 
high-radiation applications like at LHC. 

Perhaps the most obvious challenge is how to connect the individ- 
ual pixels to the cells of the readout electronic ASICs. As the pixel cells 
cover the full area of the sensor, they cannot be connected using the wedge 
bonding technique used by strip detectors. There is a well established 
industrial process for higher density interconnects than is possible with 
wedge bonding called ‘Controlled Collapse Chip Connection’ (C4) or flip- 
chip bonding. This process allows chips to be connected to Printed Circuit 
Boards (PCBs) or other chips. The steps required at the wafer level before 
dicing into individual chips are [435]: 


e Under Bump Metallisation (UBM): A sequence of metal layers is 
deposited through a mask on to the implant pads to ensure good 
adhesion, and to provide a solder barrier and a very thin gold layer 
to prevent oxidation; 


e Solder bumps: Solder is deposited on the UBM through a mask with 
typical dimensions of 100 um x 100 um; 


e Reflow: The wafer is heated to melt the solder and the surface ten- 
sion of the liquid solder results in approximately spherical shaped 
solder balls; 


e Dicing: The individual chips are then cut out of the wafer. 


After similar solder bumps are created on the mating chip, one chip is 
‘flipped’ so that the solder balls line up (hence the alternative name flip- 
chip) accurately and then heat is applied for a second reflow process. The 
solder balls melt and surface tension pulls them into accurate alignment. 
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In both cases only as many FEA 
runs/measurements are needed as there 
are thermal impedances in the network. 


Smaller pixel sizes can be achieved with 
CMOS sensors (see section 9.10). 


A aa 
Figure 9.20: PbSn solder bumps on 


ATLAS read out circuit (25 um size, 
50 um pitch) [503]. 


The solder is lead-rich PbSn for the C4 
process but eutectic PbSn (i.e. it melts 
at a lower temperature than either of its 
constituents) or Indium can be used for 
detectors. 
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The problem with the standard industrial process is that the minimum 
pitch is limited by the mask to about 250 um, too coarse for the require- 
ments of high resolution pixel detectors. This limitation has been over- 
come with the use of novel masks which allow the deposition of 25 um 
diameter solder balls and thus allowing for a pitch as small as 50 um. The 
two wafers need to be very accurately aligned and held very accurately 
parallel during this process to ensure a high yield. Failures that can occur 
are short circuits between neighbouring solder bumps and open circuits 
where the solder connection between two bumps failed. 

Bump bonding is used to connect the silicon pixel sensor to the corre- 


sponding ASIC in the first step of the fabrication of a silicon pixel module. 


To cope with the effects of radiation damage in high-radiation envi- 
ronments, pixel detectors need to be operated cold, which creates addi- 
tional stress from any mismatch in the coefficient of thermal expansion 
(CTE) between the bonded parts and can lead to solder connection fail- 
ures. This makes the process very technically challenging and the cost of 
bump bonding is a significant fraction of the total cost of a pixel module. 


bump bonds me , 
silicon pixel sensor 


N 
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Readout chip Readout chip 


As discussed in section 9.1, pixel detectors for high radiation environ- 
ments use structures that have electrons drifting towards the pad implants, 
rather than holes, which improves the speed and therefore reduces the 
losses from charge trapping. This is achieved with n* implants in n*-in-p 
or n*-in-n detectors. For pt-in-n sensors, after radiation-induced type 
inversion (see section 9.11), the naive expectation is that the diode junc- 
tion will grow from the backside. As most of the induced current comes 
from the motion of the electrons near the collection electrodes this would 
result in very small signals for sensors that are not fully depleted. This 
simple picture predicts that for n*-in-n sensors, after radiation damage the 
junction would grow from the collection electrode side resulting in useful 


signals even for detectors that are not fully depleted. Therefore n*-in-n 


sensors were used for pixel detectors in the first generation LHC detectors. 


The negative HV bias required for the n* implants is applied to the back- 

plane of the detector so the resulting electric field accelerates the electrons 
towards the collection electrodes. As discussed in section 9.1, appropriate 
guard ring structures are required. 
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The biasing technique described for strip detectors using polysilicon 
resistors would use too much area for a pixel detector. The issue is how 
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Such a pixel detector is called a ‘hy- 
brid’ pixel detector. 


Figure 9.21: Schematics of a hybrid 
pixel detector with bump bonding. In 
a pixel module the readout chips will 
be connected to a flexible PCB by wire 
bonding. 


Near the collection electrodes the 
weighting and drift fields are high. 


Further studies have revealed a more 
complex picture, in which the electric 
field peaks at the junction and backside 
of heavily irradiated sensors [233]. 


Figure 9.22: Schematic cross-sections 
of a p*-in-n pixel module (left) com- 
pared with an n* -in-n pixel module 
(right). This does not show the biasing 
structure or the inter-pixel isolation 
technique [435]. 
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to connect the very large number of pixel sensor elements to a common 
voltage level. This can be done using the virtual earth of the preamplifier 
when the sensor is bonded to the readout ASIC. 

It is essential to be able to test components like sensors and ASICs 
before bump bonding. To connect all the pixels, a “‘punch-through’ bias- 
ing technique is used. This technique provides a connection from a bias 
rail to individual pixels. As the bias voltage is increased, the depletion re- 
gion around the bias rail increases until it connects to the implant region 
for the pixel. Thermionic emission allows current to flow from the bias 
rail to the pixel, thus establishing the required bias voltage on the pixel. 
Using this technique we can measure the current as a function of the bias 
voltage for unbonded sensors and reject failed sensors before further inte- 
gration [316]. 

The key advantage of a pixel detector compared to strips apart from 
the obvious smaller size of the active element size is that the much smaller 
area of one pixel compared to a strip results in a lower capacitance and 
therefore lower noise (see eq. (3.11)). Another advantage of the smaller 
cell size is that leakage currents per channel are much smaller. Hence the 
shot noise contribution (eq. (3.4)) is also much reduced. In addition, the 
leakage currents are sufficiently low that it is not necessary to AC-couple 
the readout. The much lower noise implies that pixel sensors can operate 
successfully even after radiation damage has reduced the magnitude of the 
signal (see section 9.11). 

The disadvantage of pixel detectors compared to strip detectors is that 
they will present more material per layer than strip detectors because of 
the bump bonds and the fact that the size of the readout ASICs is as large 
as that of the sensors. Pixel sensors also have a considerably higher cost 
per area. These advantages and disadvantages are some of the key factors 
that drive the choice that in present collider detectors pixel detectors are 
placed close to the beam, and strip detectors at larger radii. 

The simple pipeline concept used for the readout of strip detectors 
would use up too much area to be practical for a pixel chip with a much 
larger channel count. However, we still need a way of storing pixel data 
during the latency period required by the trigger logic. One architecture 
transfers hit pixel address and time stamps to register the hits. When a 
trigger is received the hits with the correct time stamp are read out [2]. 

As the hit occupancy is much lower than for a strip detector, in a pixel 
system the signals from the preamplifiers can be slower. In that case in 
high intensity colliders the signal will in general be above threshold for 
several clock cycles. The number of clock cycles is a measure of the ‘time 
over threshold’ (TOT) and the TOT can be used as proxy for the charge 
deposited in the particular pixel. 


SILICON DRIFT AND DEPFET DETECTORS 


The strength of pixel detectors is also their biggest challenge: a very high 
channel density, requiring a large number of readout channels and con- 
nections between the sense elements and the electronics. For applications 
where high position resolution is needed, but at lower rate, alternative de- 
signs have been developed, where the charge carriers created by the in- 
coming particle are transported within the silicon towards the edge of the 
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A virtual earth (sometimes called a 
virtual ground) is effectively held at 
ground without being directly con- 
nected. For example, consider an op- 
amp in a circuit using negative feedback 
with the non-inverting input connected 
to earth. The negative feedback will 
result in the inverting input being held 
at a voltage very close to earth even 
though the impedance between the two 
inputs will be very high, hence the name 
virtual earth. 


The typical failure mode for silicon 
detectors is a rapid increase in detector 
leakage current at voltages below that 
required for full depletion. 


Therefore one can use thinner detectors 
which produce a smaller signal, but 
have higher electric fields and faster 
signals as well as reduced material, 
which reduces multiple scattering. 


The amplifiers then need leakage cur- 
rent compensation to prevent saturation 
and slow drifts as the leakage current 
changes. 


This is called ‘column drain’. 


This is used to obtain higher precision 
centre of gravity measurements of pixel 
clusters than could be achieved with a 
simple binary approach. In addition, 
this allows for particle identification 
using the dE/dx method (see chap- 

ter 12.3) [401]. Note that this is only 
useful for PID at low momenta (below 
minimum-ionising) due to the density 
effect. 
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sensor or towards a single collection point, thus reducing the spatial di- 
mensions of the readout by one. This information can be recovered from 
the arrival time of the signals. 

If two planar pn junctions are placed back-to-back (with the p* layers 
on the outside) and connected as usual to a bias voltage through an n” 
contact, but this time at the edge of the silicon, the potential will form a 
potential trench across the thickness of the silicon when the bulk is fully 
depleted. Electrons liberated in the bulk will be contained by this potential 
and move towards the bias connection, drifting in the plane of the sensor 
to the edge. This is the principle of the silicon drift detector (SDD) [257]. 

Practical silicon drift devices are made of a number of p* strips on 
both faces of the sensor at appropriate potentials to create the drift field, 
and a single anode n* strip for the readout. To obtain position information 
orthogonal to the drift direction the anode can be divided into pads, and 
the second coordinate obtained from the sharing of the charge induced in 
the pads. 

A feature of silicon drift detectors is that the detector capacitance 
drops abruptly when the sensor becomes fully depleted. While in the un- 
depleted case the capacitance is formed over the whole sensor area, in the 
depleted case it is given only by the capacitance from the anode to the ad- 
jacent electrodes, and thus is much smaller and becomes independent of 
the area. To further reduce the input capacitance to the readout amplifier, 
the first transistor can be implemented as a FET on the detector with the 
sensor anode providing the gate. The small capacitance results in a very 
fast signal and low noise. 


| ee ee 
Cn’ pr 


xX OTHER Al CONTACTS FLOATING 
© OTHER Al CONTACTS AT THE SAME POTENTIAL. 4 


00 
"o f~ ] 


Hw 
| oe 


UNDEPLETED SILICON 
(CONDUCTOR) 


mis oes 
DEPLETION : 
REGION 


n* CONTACT 


o 
BELOW DEPLETION 


o 
AT DEPLETION 


Lsa 


o rae S aw. wee a ee | 
ABOVE DEPLETION ena Ea eo Ee 
Up*,n? (V) 


In the SDD the cathode strips shield the anode from the drifting elec- 
trons, so that they only induce a signal on the anode when they are very 
close, and, like for gaseous drift chambers, the position can be inferred 
from the time it took the electrons to drift from the point of ionisation to 
the anode. Like in the case of gaseous drift chamber, calibration of the 
drift velocity is important, and can be accomplished by injecting charges 
at known locations. 

The position resolution achieved in SDD position detectors is a few 
tens of um. The challenge is that the material must be very pure to prevent 
trapping of the electrons during the long drift. Because of the low density 
of readout channels, and to allow for a charge sharing measurement, the 
readout is typically analogue. 
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This is referred to as ‘sideward deple- 
tion’. 


Another type of detector with lateral 
charge carrier movement is the charge 
coupled device (CCD), where the in- 
dividual pixels are made of CMOS 
capacitors. In the readout, the charge 
accumulated in each capacitor from 
the passage of ionising radiation is 
passed on to the next capacitor in a 
row controlled by a clock signal. The 
charge is thus moved towards the edge, 
where it gets amplified and digitised. 
This is done until the charge from all 
capacitors has been collected. For a 
long time CCDs have been the most 
important image sensor technology, and 
the have been used in the SLD vertex 
detector [363], but they are now mainly 
replaced by active CMOS sensors. 


Figure 9.23: Depletion at different 
stages (left) and capacitance as a func- 
tion of depletion voltage (right) for an 
SDD [257]. Shown is a bias connection 
at the edge. In practical detectors the 
p* electrodes are segmented into strips 
and the n* electrode is also placed on 
one of the facings of the sensor, but the 
basic principle remains. 


The additional benefit of analogue 
readout is that the result can also be 
used in a measurement of the energy 
loss dE/dx. 
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Silicon drift detectors are attractive in energy-dispersive spectrometer 
(EDS) X-ray spectroscopy because they are fast and low-noise, achieve 
a good energy resolution at room temperature, and, as the devices are 
fully depleted and all the device thickness is sensitive for the absorption of 
X-rays, they have a high efficiency (for a 300 um thick sensor the detec- 
tion efficiency for 10 keV X-rays is 90%). 

Such detectors are typically circular with a small central anode pad, 
again to minimise the detector capacitance [325]. The energy resolution 
and noise can be further improved by cooling the devices to a few tens of 
degrees below 0 °C, which can be conveniently achieved by Peltier cool- 
ers. 
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The combination of a fully depleted bulk with a FET results in a sen- 
sor that produces a current signal without the movement of charge carriers, 
but by a static field effect [303]. Like in the silicon drift detector the ioni- 
sation charges are captured by the potential in a fully depleted n-type bulk 
but this time underneath a p-type FET, where they control the current be- 
tween the drain and source terminals of the FET (they provide an ‘internal 
gate’ function). 
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pt Bias 


The benefit of this is that the signal charge can be much lower, and 
thus the sensor thinner (about 50 um) than in a conventional charge- 
collecting pixel sensor. As the current signal in a DEPFET is not created 
by the movement of charges, but just by their presence at the internal gate, 
the measurement is non-destructive and the device can store the charge. 
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Figure 9.24: ALICE Silicon Drift 
Detector layout with a zoom of the top 
right corner [240]. 


The EDS technique works by using a 
electron beam to excite electrons from 
inner shells. The electrons from higher 
energy levels will fill these holes and 
the energy of the resulting fluorescent 
X-rays can be used to identify individual 
chemical elements. 


Figure 9.25: Cross-section of a cylin- 
drical silicon drift detector for X-ray 
spectroscopy with integrated n-channel 
JFET. The gate of the transistor (G) 

is connected to the collecting anode. 
The radiation entrance window for the 
ionising radiation is the non-structured 
backside of the device [214]. 


Such a device is called a depleted FET 
(DEPFET). 


Figure 9.26: Schematics of a DEFPET 
detector (left) and potential within the 
bulk underneath the gate contact (right) 
[304]. 


The pixel detector of the Belle experi- 
ment [492] is using DEPFET detectors 
that are 75 um thick in the active re- 
gions. 
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Only when a suitably high voltage is applied to a ‘clear’ contact (an n” 
electrode) the charges are drained away. 

The charge storing capacity of the DEPFET and the non-destructive 
measurement allows for dual-FET devices, in which the signal charge can 
be repeatedly transferred between the two FETs, allowing for a statisti- 
cally independent repetition of the charge measurement and thus a reduc- 
tion of the noise to sub-electron level. This allows the resolution of the 
signal from single photons [351]. 
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9.8 PHOTON DETECTION WITH SEMICONDUCTORS 


The small band gap in semiconductor detectors makes them attractive for 
the detection of photons. The pn diodes we considered for charged particle 
detectors can also be used as photon detectors. 

Improved photodiodes use a p-i-n structure, in which there is a thin 
intrinsic (i) layer between heavily p and n-doped regions. The i layer can 
be fully depleted with a low bias voltage because of the low carrier con- 
centration and the small thickness of the i layer. The increased depletion 
thickness increases the sensitive volume for photon detection. If a photon 
is absorbed in the i layer it will create an electron/hole pair. The electron 
and the hole will drift in the high field of the depleted region. The high 
drift velocities combined with the small thickness of the i layer will result 
in a fast signal. 

The photon absorption coefficient for intrinsic silicon as a function of 
wavelength is shown in Figure 9.28. 


Absorption coefficient [em] 
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The band gap in silicon is 1.12 eV, which would correspond to a wave- 
length of ~1100 nm. However, as silicon is an indirect bandgap semi- 
conductor, the energy required for a transition without any change in mo- 
mentum is ~3.4 eV. At intermediate energy (i.e. in the range between 
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These are called repetitive non- 
destructive readout DEPFETs (RNDR- 
DEPFETs). 


Figure 9.27: Single photon spectrum 
measured at low light intensity from 
an optical laser (A = 672 nm) with 
a circular RNDR-DEPFET at a tem- 
perature of 55 °C, obtained with 300 
readouts [351]. 


When silicon detectors are used for 
detecting charged particles they need to 
be in the dark to avoid large unwanted 
currents. 


The intensity of light penetrating a 
distance x is given by I = Inexp(—ax), 
where œ is the absorption coefficient. 


Figure 9.28: Attenuation constant 
for intrinsic silicon as a function of 
wavelength. Data from [269]. 
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1.12 and 3.4 eV), transitions from the valence to the conduction band re- 
quire phonons as well as the photon. This explains qualitatively why the 
attenuation constant is decreasing rapidly starting with wavelengths of 
~400 nm. We can see that a very thin intrinsic layer can give a high prob- 
ability for absorbing a photon for visible light and the near infrared. For 
longer wavelengths we need to use compound semiconductors like GaAs 
or InGaAs. 

The intrinsic layer can be chosen to be just thick enough for there to 
be a high probability of photon absorption for the target wavelength. This 
allows for the photodiode to be fully depleted at very low bias voltages. 
It also results in very short transit times, which allows for very high data 
rates. 

One critical figure of merit for a photodiode is the responsivity R. 
The photocurrent / for a photon power incident on the photodiode Py is 
I = RP}. The responsivity can be written in terms of the external quan- 
tum efficiency Nex and the photon frequency v as R = Nexe/ (hv). We can 
re-write this in terms of the reflectivity S of the surface and the internal 
quantum efficiency Nin 


[1 — S(A)] Nine 
hv f 


The responsivity can be increased by an anti-reflective coating on the sur- 
face. 

The other important parameter for a photodiode is speed. The intrinsic 
speed of a photodiode depends on the drift time and hence on the thick- 
ness of the active layer, the bias voltage and the carrier mobilities. How- 
ever, we also need to match the speed of the amplifier, which depends in- 
directly on the capacitance and hence the area and depth. The absorption 
length in InGaAs is shorter than in Si. Therefore, thinner active regions 
can be used. In addition, the electron mobility is larger in InGaAs than Si. 
Therefore the maximum drift time for InGaAs photodiodes will be less 
than that of Si photodiodes. This provides an additional reason that the 
fastest diodes used in high speed telecommunications are based on com- 
pounds like GaAs or InGaAs. 


9.9 SILICON DETECTOR APPLICATIONS WITH INTERNAL 
GAIN 


There are three main motivations for the use of internal amplification in 
semiconductor detectors. 


e Compensation for the reduced ionisation charge in thinner silicon 


sensors (about 50 um) to reduce material. This requires only moder- 


ate gain; 


e Improved timing performance to tag hits belonging to primary ver- 
tices close in time (for example within the same bunch crossing at 
LHC, see section 9.9); 


e Detection of photons in low-intensity electromagnetic radiation, in 
particular if single photons need to be detected. 
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For example for a wavelength of 750 nm 
anda 10 um thick layer, the probability 
of a photon being absorbed is 73%. 


GaAs and InGaAs are examples of di- 
rect bandgap semiconductors and there- 
fore have shorter attenuation lengths for 
energies just above the bandgap energy. 
These compound semiconductors are 
used for long distance telecommunica- 
tions that use a wavelength of 1310 nm 
or 1550 nm. 


The external quantum efficiency is 
defined as the probability of a photon 
hitting the device being converted to 
an electron-hole pair. The internal 
quantum efficiency is the probability 
of a photon entering the device being 
converted to an electron-hole pair. 


A typical value for a silicon photodiode 
operating at 850 nm is R = 0.5 A/W. 


For a given power, if there is an upper 
limit on the acceptable noise, this will 
place an upper limit on the bandwidth 
of the amplifier as for silicon particle 
sensors. 
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In chapter 4 we have introduced the multiplication of charge carriers 
due to impact ionisation. In semiconductors fields high enough to cause 
impact ionisation can be achieved by appropriate doping and bias voltage. 
In section 9.1 we have discussed the concept of silicon detectors based 
on highly doped implant regions in lightly doped bulk material, either 
p?-in-n~ or n*-in-p~ or nt-in-n7. If an additional doping layer of n in 
the former case or p in the latter is introduced in between, a region of high 
field is created at the p*/n or n™/p interface, respectively. This field can 
be high enough to cause impact ionisation and create more ionisation in 
avalanche multiplication. The strength and size of this accelerating field 
can be controlled by the doping concentration and the depth of the addi- 
tional doping, and the bias voltage. 


Charge multiplication p E v 


Primary charge 


Undepleted 


A range of device designs use this approach, but with different ranges 
of gain. Because the primary ionisation from charged particles in silicon 
sensors is already high, a small gain will cause a large signal. 


e Low gain avalanche diodes (LGADs) have a gain of around 10, 
which can be useful to boost the signal from a charged particle, 
allowing for the operation with thinner silicon sensors, or, for the 
same thickness, providing better time resolution (see next section). 


Inapt/n/n~ LGAD the secondary electrons only drift a short dis- 
tance to the collection electrode and they get quickly removed from 
the induced current signal. Their contribution to the integrated sig- 
nal is correspondingly very small. The secondary holes contribute 
most of the current and the integrated signal as they drift over the 
full depth of the detector. In order for the weighting field to be large 
over the full depth of the detector, the width of the collecting elec- 
trode needs to be comparable to the depth of the detector. 
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One of the big challenges for LGADs is radiation damage. Apart 
from the effects discussed for silicon detectors in chapter 9.11, the 
acceptor creation in the p* layer will decrease the net carrier con- 
centration and therefore lower the gain. 
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Figure 9.29: Schematic cross section 
of an n*/p/p~ avalanche photodiode 
(left) and doping profiles and fields 
(right) [350]. 


These type of sensors allow precision 
measurements in space as well as 

in time and this can be used for 4D 
tracking as discussed in section 10.2. 


This is more like the weighting field for 
a simple parallel plate capacitor, rather 
than a micro-strip. 


Figure 9.30: Simulated pulse from an 
LGAD [256], originally published 

in [184]. The curves labelled ‘electrons’ 
and ‘holes’ are due to the primary ioni- 
sation charges, whereas ‘gain electrons’ 
and ‘gain holes’ are the signals from 
charges that have been produced by 
impact ionisation. 
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e Avalanche photodiodes (APDs) have a gain of about 50 to 500. They 
are in wide use for the efficient detection of photons in, or close to, 
the visible region. In this ‘reach-through’ structure, photons enter 
from the n* side and have a high probability of penetrating into 
the depleted layer where they are absorbed. The resulting electrons 
drift towards the highly doped ‘metallurgical junction’ formed by 
the pt /n* doping. In this region the electric field is large enough 
to create impact ionisation and hence create a carrier gain M. For 
operation in the linear mode the peak electric field should be lower 
than the breakdown field. 


The avalanche process introduces additional fluctuations and the 
resulting shot noise is increased compared to a simple photodiode. 
This is accounted for by an excess noise factor f. An expression for 
the excess noise factor from a theoretical analysis is [362] 


1 
M?’ 


where k is the ratio of hole-to-electron ionisation rates. Silicon has 
a much lower k value (typically k < 0.1) than other semiconduc- 
tors. In addition, typical leakage currents for silicon photodiodes are 
much lower than for photodiodes made with other semiconductors. 
Therefore the lowest noise APDs use silicon (unless the wavelength 
A > 1100 nm, for which there is almost no absorption in silicon). 
For longer wavelengths other semiconductors like InGaAs are used. 
The excess noise in the simple p-i-n structures for these materials 

is avoided by the separation of the absorption and multiplication re- 
gions. These devices are called separate-absorption-multiplication 
(SAM) APDs [301]. 


If we had a noiseless amplifier, the noise from an APD would al- 
ways be worse than that from an un-amplified photodiode. However, 
for a real amplifier we can minimise the overall noise for low light 
intensities. For a shaping time T, leakage current Z, an equivalent 
noise resistance R for the amplifier and a total capacitance of C, the 
noise is given by (see eq. (3.16)) 


f =kM+(1—k) (2 (9.27) 


4kpT RC2 


2 
oj = 2elft+ Wt 


(9.28) 
Silicon APDs usually operate with M > 20, therefore the excess 
noise factor is given to a very good approximation by f ~2+kM, 
where k is defined in eq. (9.27). We can then determine the optimal 
multiplication by differentiating eq. (9.28), 


2\ 1/3 
m= (3S) 


9.29 
ekIt2 ( ) 


e Single photon avalanche diodes (SPADs) provide an even higher 
gain of 10° to 10°. They operate in breakdown mode, which means 
that a single charge carrier injected into the depletion layer can trig- 
ger a self-sustaining avalanche in the device. Here the feedback in- 
volves the creation of holes, which at the high field lead to further 
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For wavelengths à <~400 nm the pho- 
tons are typically absorbed before they 
reach the depleted region. Therefore 
APDs designed for shorter wavelengths 
use side illumination. 


We are ignoring any surface leakage 
current because it will not be amplified. 


Note that f in this expression is the 
excess noise factor defined above, and 
not the frequency. 


A typical value for M for a silicon APD 
is ~100. 


This can be a thermally produced 
charge carrier. 
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multiplication. The discharge is stopped by temporarily reducing the 
bias voltage (‘quenching’), either by a resistor or by other means. 
Proportionality of the output signal with the input is lost. 


Because a single electron, independent of its origin, can cause 
an avalanche in SPADs, they have a significant ‘dark rate’ that is 
caused by thermally generated electron-hole pairs. 


HIGH PRECISION TIMING DETECTORS 


The different types of silicon detector we have considered so far are typi- 
cally optimised to obtain high spatial resolution. Therefore, these systems 
were optimised to maximise the S/N at the cost of limited timing preci- 
sion (@(10 ns)). For LHC detectors, the time resolution achieved this way 
was sufficient to enable hits to be uniquely associated with a given bunch 
crossing (every 25 ns). For HL-LHC there will be up to 200 interactions 
per bunch crossing. Therefore, disentangling the vertices will be much 
more challenging. The collisions occur over a range in distance along the 
beam and also in time. Therefore, if we can achieve good timing precision 
we can greatly enhance the ability to separate the different vertices. 

For detectors not targeting timing measurements, the goal of the op- 
timisation was to maximise the S/N, but if we want to measure time pre- 
cisely we need a different optimisation. The precision of the timing mea- 
surement due to noise is 


(9.30) 


where V = V(r) is the output voltage as a function of time ft, and oy is the 
voltage noise. The ‘slew rate’ dV /dt œ 1/tr, where tr is the rise time of 
the preamplifier and t, ~ 1/Af where Af is the bandwidth. The voltage 
noise of the amplifier oy ~ ,/Af. Therefore we need a higher bandwidth 
to achieve a good timing resolution than for the optimisation of the S/N. 

However, there is no benefit in using a faster amplifier rise time than 
that of the signal in the sensor (ts) and the optimum is when t, = t; (see 
exercise 6). In order to further improve the timing precision it is necessary 
to increase the sensor signal, and sensors with internal signal amplification 
will be useful. An additional benefit of the amplification is that it allows 
for a reduction of the sensor thickness, which will reduce the length of the 
arrival time distribution, further improving the time resolution. 

Already the limited gain provided by an LGAD is sufficient, and a 
resolution of 34 ps has been achieved with a 50 um thick LGAD [185]. 

As usual for all precision timing measurements of analogue signals, 
time walk contributes to the timing resolution, and the use of a constant 
fraction discriminator is required for ultimate timing performance (see 
section 3.8). 

It is possible to achieve even better timing resolution using SPADs. 
However, the ‘dark rate’ for these type of detectors in a high-radiation 
environment is extremely high, limiting their use. 
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As this is similar to the operation of 

a gaseous Geiger counter, SPADs are 
sometimes referred to as operating in 
Geiger mode, although the feedback 
mechanism is not involving long range 
photons. 


This is called 4D tracking, see sec- 
tion 10.2. 


As discussed in section 3.2, the voltage 
noise power density dv? /df is constant. 


Alternatively, the timing information 
can be corrected for signal height 
variations after measuring the signal 
height using time-over-threshold (TOT), 
or sampling the signal (at the cost of an 
increase of data). 


A very large dark rate and a very large 
number of channels places challenging 
requirements for the readout bandwidth. 
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PHOTON DETECTORS WITH INTERNAL GAIN 


For the readout of particle detectors like scintillators we usually need to 
achieve low noise for low photon intensities. For low intensities (low pho- 
ton incidence rate), the signal becomes too small for detection at the de- 
sired data rate and power in the amplifier. The answer is again to employ 
an internal amplification process based on impact ionisation. 

We have already introduced APDs and SPADs as semiconductor de- 
tectors with internal amplification that are optimised for photon detec- 
tion. Silicon photomultipliers (SiPMs) are arrays of SPADs (up to 10000 
per mm”). Their typical gain is similar to conventional PMTs, but with the 
benefit of small pixel size and being unaffected by magnetic fields. 

The rise time of the pulse from a SiPM is determined by the avalanche 
process, so it is very fast (f; < 1 ns). 

There are two possible readout modes of a SiPM: 


1. Analogue. The output of all the quench resistors are ganged to- 
gether. Excellent timing resolution can be achieved [316] even for 
single photons (~100 ps). The speed of the falling edge is deter- 
mined by the RC time constant, where R is the value of the quench 
resistor and C is the pixel capacitance. Typical values for the time 
constant are in the range 30 to 100 ns [316]; 


2. Digital. The output of each pixel goes to a discriminator and the 
number of ‘fired’ pixels are counted by digital logic. The quench- 
ing is performed using MOS switches rather than passive resistors. 
Noisy pixels can be switched off. One disadvantage of this mode is 
the higher power consumption. 


The advantages of SiPM over PMTs are a higher quantum efficiency, 
a very fast rise time and that they can operate in a strong magnetic field, 
which is a benefit in large particle experiments with their spectrometers, 
but also in Positron Emission Tomography — Magnetic Resonance Imag- 
ing (PET-MRI). PMTs on the other hand have a lower dark count and can 
have a large area photocathode. UV detection is possible for PMTs with 
custom photocathodes. APDs are usually cheaper than PMTs. 

This comparison suggests that there are applications that benefit from 
a combination of these two technologies. An example for this is the Hy- 
brid PMT (HPMT). An HPMT has a conventional photocathode like in a 
PMT. Instead of having a chain of dynodes there is one stage of electro- 
static acceleration with a voltage of ~ 10 kV. The resulting electrons are 
detected in an APD. The high energy electrons create © (103) electron- 
hole pairs. The electrons create avalanches in the APD resulting in a fur- 
ther gain of @(107). Therefore HPDs can have very large gains like a 
PMT, but faster rise times which allow better timing resolution. As there is 
a much larger gain from the electrostatic stage, the pulse height resolution 
for single photons is better than for PMTs. 

Another type of hybrid photodetector is the hybrid photodiode (HPD), 
which uses a conventional photocathode and a very high electric field to 
accelerate the electrons towards a pixel pn diode. 

Photoelectrons emitted by the photocathode are accelerated by a po- 
tential of ~20 kV towards a silicon pixel pn diode. The electric field is 
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SiPMs are sometimes also called Multi 
Pixel Photon Counters (MPPCs). 


The resistivity of polysilicon is low, 
so that a long, thin track is needed 
to create a passive resistor, which is 
typically larger than a transistor, so 
that this mode has the advantage of 
requiring less silicon area. 
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also used to ‘demagnify’ and focus the photoelectrons on to the diode. 
This creates a signal of about 5000 electron-hole pairs in the silicon. The 
diode is read out by a custom ASIC which is bump-bonded to the diode. 
The very small pixel area corresponds to a very small capacitance and 
hence the readout speed can be high while the electronic noise can be very 
low. Therefore there is no need for the amplification of an APD and there 
is no excess noise factor. The resolution is sufficient to resolve the peaks 
from 1, 2, 3 and 4 photoelectrons [380]. 


9.10 MONOLITHIC ACTIVE PIXEL SENSORS 


Bump-bonding is a demanding operation, and obtaining high yields is 

a challenge. It is therefore attractive to integrate at least the first stages 

of the analogue readout into the pixel sensor. Such a detector is called a 
Monolithic Active Pixel Sensor (MAPS). Current approaches exploit the 
wide availability of CMOS technology for this approach. This technology 
is widely used to make CMOS imagers as used in mobile phone cameras. 
The same sensors could in principle be used as detectors for charged parti- 
cles but there are several disadvantages: 


e Imaging chips only require a thin sensitive region, due to the short 
mean free path for visible photons. The signal from minimum- 
ionising particles in thin sensors is small; 


e The demands for speed in imaging chips are usually not high, so 
that it suffices to rely on diffusion rather than drift. Particle detec- 
tors, in particular after irradiation, need to be faster; 


e Image sensors do not usually have 100% ‘fill-factor’, i.e. not all the 
area of each pixel is sensitive. 


For applications at low rates and low radiation damage the slow sig- 
nals from diffusion can be used. An epitaxial layer is used on a p-doped 
substrate which has a much higher conductivity. The signal is collected on 
an n-well with an n* terminal. Without a bias, there is only a small deple- 
tion region at the edge of the n-well, and most of the signal comes from 
electron diffusion in the epitaxial layer. The collection n-well can then be 
connected to CMOS transistors on the same wafer, which can be part of an 
amplifier or a buffer circuit. 
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Figure 9.31: Schematic cross-section of 
an HPD [28]. 


The photocathode area is larger than 
that of the diode. 


An epitaxial layer is grown on a sub- 
strate layer so that it is formed with 

one or more well-defined orientations 
with respect to the seed layer. Epitaxy 
results in improvement of the electrical 
characteristics of the epitaxial layer in a 
highly controlled manner. 
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The signal from the thin epitaxial layer is small, @(1000 e~), but a The difference between a DEPFET and 
good S/N can still be achieved because of the very small capacitance of ae a eee is that tn the former 
i sia . ; case the charge is trapped temporarily 
the collection electrode (as the noise is proportional to the capacitance, see in the depletion region and its field is 
: : : : ; : : controlling the drain-source current 
eq. (3.1 D). As the signal is from diffusion rather than drift, the signal is in the FET: whereas inthe taner ihe 
slow. Typical collection times are O( 100 ns) Ś charge is collected and then conducted 
Other transistors, which are part of the active electronics can be con- “° 4 80'e of a transistor: 
tained in the same chip as the sensor, but any n-wells need to be insulated 
by a p-well, so that the transistor is not affected by parasitically collected 
charges. By embedding NMOS (PMOS) transistors in p-wells (n-wells) 
we can use the full functionality of CMOS electronics. Therefore all func- 
tions of the readout ASIC can in principle be incorporated into the MAPS 
chip. 


Figure 9.32: Schematics of a MAPS 
detector [256]. 
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Another advantage of MAPS over hybrid pixel detectors is that they 
are much thinner as there is only the combined sensor/readout chip com- 
pared to the separate sensor and readout ASIC required for hybrid detec- 
tors. In addition, MAPS chips can be thinned as the bulk silicon is not 
used. This allows MAPS to be an order of magnitude thinner than hybrid 
pixel detectors. This approach has been used very successfully for vertex 
detectors for heavy ion collisions [210, 300]. 
However, diffusion-type MAPS cannot be used for high rate applica- 
tions like the LHC, where we need collection times of @(10 ns). In ad- 
dition the radiation tolerance of MAPS is very poor because of charge 
trapping and the long collection time. Therefore we need fast charge col- 
lection by drift caused by the field created in a depletion region, rather These type of MAPS are therefore called 
? 5 Depleted MAPS’ or DMAPS. 
than by diffusion. 


Figure 9.33: Schematics of a DMAPS 
detector [256]. 


50-100 um 
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Following eq. (9.4) the depletion depth at a pn junction scales as 


d œ V P Vbias , 


where p is the resistivity and Vpias the bias voltage. A large depletion pi eens pis pa eis H 
. . . . . - conductivity O, and the conductivity 
depth is thus achieved for high resistivity and bias voltages. in semiconductors © = e( Unn + 
Hence, there are two approaches for DMAPS: Hpnp), with the electron charge e, the 


mobilities for electrons and holes ln, p, 
á TS r woga k sys : , respectively, and the charge carrier 
e High resistivity (HR) MAPS. High resistivity silicon is used, which P Map: = 
allows a sufficiently large depleted region to be created at low volt- 


age; 


e High voltage (HV) MAPS. A special high voltage process, 


HVCMOS, is used to allow a bias voltage of (100 V) to be ap- The HVCMOS process is also required 
` . k ` . for the automotive industry and thus has 
plied, which generates a significantly deep depletion layer. become commercially relevant. 


The highest charge collection efficiency is achieved for devices with 
high resistivity and high bias voltage, and if a high fill factor can be 
achieved. 

In the DMAPS the drift field is generated by a deep n-well. The size 
of this electrode determines the fill factor of the sensor. The advantage of 
a small collection electrode is that it represents a small capacitance and 
therefore results in lower noise (eq. (3.11)). However, the drift distances 
are longer than for the large collection electrode, and therefore it will be 
more sensitive to charge trapping after radiation damage. A large collec- 
tion electrode will have higher capacitance and therefore higher noise than 
a small collection electrode. 


charge charge 
signal CMOS signal CMOS 
electronics electronics 
pt Si PW — EM pt n+ 
deep nwell | = 
e7 p - substrate p - substrate 


Figure 9.34: Schematics of different DMAPS geometries. Left: large collection electrode. Right: small collection electrode [256]. 


While there has been very significant progress in the design of radia- 
tion tolerant DMAPS, achieving the required performance for use at the 
LHC is very challenging. This is, however, a very active field, so more 
progress should be expected in the coming years. 


9.11 RADIATION DAMAGE IN SEMICONDUCTOR 
DETECTORS 


Radiation damage can have very serious consequences for the operation 
of semiconductor detectors. The same type of damage mechanisms affect 
all semiconductors, but we will again focus on silicon detectors as they are 
the most widely used detectors in high radiation environments. 

In semiconductors the main effect of radiation is due to bulk damage 
to the crystal lattice. Charged particles interact with the silicon nuclei by 
Coulomb scattering, and neutrons by the strong (or nuclear) force. If a 
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primary particle has sufficient energy, it can knock out a silicon atom from 
the lattice. This will result in a ‘vacancy’ in the lattice where the knocked- 
out atom came from. The knocked-out atom can become trapped in the 
lattice, which results in an interstitial state. The vacancy or interstitial state 
can also interact with impurities present in the lattice to produce more 
complex defects. 


Conduction band f 1 
Donor (+) ia 
Donor (-) ——— 
y 4 3 


Valence band 


Charged Generation- Trapping 
defect recombination centre 
centre 


If the energy of the knocked-out atom is sufficiently high, then this 
atom can knock out further silicon atoms from the lattice, resulting in 
large defect clusters. In general, these defects will result in the creation of 
new energy levels in the band gap between the valence and the conduction 
band. 

The impact of these new energy levels will depend critically on where 
they are in the gap. As we have seen in section 9.5, those defects near the 
centre of the band gap will contribute to an increase in leakage current. 
Energy levels near the conduction or valence band on the other hand will 
affect the effective doping concentration. Other energy levels in the band 
gap can result in trapping centres for mobile electrons or holes. This ef- 
fect reduces the distance free carriers can drift and therefore reduces the 
measured signal. 

Some of the defects can be mobile and therefore there will be a com- 
plex temperature dependent time evolution of the radiation effects. Va- 
cancies and interstitial defects can ‘annihilate’, resulting in ‘beneficial’ 
annealing. However, neutral defects can combine to create new charged 
states and this affects the net effective doping density with serious impli- 
cations for the long term operation of silicon detectors in high radiation 
environments. This is called ‘reverse annealing’. 

The bulk radiation damage is a function of the type and energy of the 
particle causing it. Detectors at hadron colliders are exposed to a compli- 
cated mixture of different particle types, each with different energy spec- 
tra. Particles lose energy in the sensor material by ionisation, but also by 
other, non-ionising, processes. The non-ionising energy loss in a crystal 
like silicon will be converted into phonons and create crystal defects. The 
Non-Ionising Energy Loss (NIEL) scaling hypothesis assumes that degra- 
dation phenomena in silicon detectors under irradiation are proportional 
to the NIEL in the material. It relies on the assumption that damage scales 
with the displacement energy. It does not consider the actual microscopic 
details of the damage for different particles and/or energy. 

Using the NIEL scaling hypothesis, we can scale the expected 
displacement damage caused by particles of different energy to that 
of one particular particle at a fixed energy. It is conventional to use 
1 MeV neutrons for this normalisation. Therefore all fluences will be ex- 
pressed as @eq, where this refers implicitly to 1 MeV neutrons. 
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The threshold energy for this process 

is calculated and the implications 

for radiation damage by low energy 
neutrons and electrons are considered in 
exercise 7. 


Figure 9.35: Mid-band energy levels. 


This is a good approximation for silicon 
but is much less reliable for compound 
semiconductors like GaAs. 


The unit used is often written as 
Neq/cm?, or (1 MeV) Neq/cm?. 
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In addition to the effects of bulk damage, radiation will create 
electron-ion or electron-hole pairs in semiconductors. The magnitude of 
the ionising dose is given by the energy absorbed per unit mass. As the en- 
ergy absorbed per unit mass depends on the atoms involved, a meaningful 
dose figure must specify the material. As the common material in silicon 
detector systems is silicon it is standard to use Gy(Si). Damage from ion- 
ising dose is usually not significant for the silicon sensors but it can be for 
the electronics or structural materials. 


LEAKAGE CURRENT AS A FUNCTION OF IRRADIATION 


Radiation damage will create mid-band states and therefore increase the 
leakage current. The increase in leakage current generated per unit sen- 
sor thickness as a function of the fluence @ in a volume V can be parame- 
terised by 

Teak = aov, 


where the linear rise reflects the NIEL scaling hypothesis. 

Annealing of the mid-band states will always reduce the leakage cur- 
rent, and higher temperatures accelerate the annealing. Temperatures dur- 
ing and after irradiation therefore need to be carefully monitored to make 
leakage current measurements comparable. 

The increase in leakage current due to irradiation has two serious im- 
plications for the operation of silicon detectors: 


e It will result in an increase in the shot noise (see eq. (3.4)), which 
will degrade the S/N; 


e As discussed in section 9.5, if the leakage power exceeds the heat 
removal capability of the thermal design, the sensor temperature can 
become unstable and the detector can go into thermal runaway. 


DOPING CHANGES 


Energy levels in the band gap close to the bottom of the conduction band 
will tend to have electrons promoted to the conduction band by thermal 
motion, thus leaving behind fixed positive charges. Similarly, energy levels 


(9.31) 
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Figure 9.36: Displacement damage 
function for different particles in 
silicon as a function of the particle 
energy [337]. 


The displacement damage function 
D(E) relates to the NIEL (in units of 
MeVcm"/g) as 

Na 


ma 


NIEL(E) = D(E), 
where N4 is Avogadro’s number and ma 
the molar mass of the material. 


The SI unit for ionising dose is the Gray 
(1 Gy = 1 J/kg). The older unit of Rad is 
also still used, 1 Gy = 100 Rad. 


The effects of radiation on the electron- 
ics have been discussed in section 3.9. 
Data on radiation hardness of different 
materials can be found in a series of 
CERN technical reports [471, 470, 134, 
340]. 


Also, the dependence of the leakage 
current on the instantaneous sensor 
temperatures as described in section 9.5 
needs to be taken into account. 
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Figure 9.37: Left: Leakage current increase as function of particle fluence for various silicon materials with different resistivities and conduction 
type. The current was measured after a heat treatment of 80 min at 60 °C and is normalised to the current measured at 20 °C. Right: Current-related 


damage rate & as a function of cumulated annealing time at different temperatures [378]. 


close to the top of the valence band will result in fixed negative charges. 
Defects created by radiation can combine with dopants to remove some of 
the fixed charges. 

The net result of these complex processes is that there will be a net 
change in the effective charge number density ner. The dominant effect 
in n-type silicon (initially with positive space charge) is to create acceptor 
states that can change the silicon from n-type to p-type due to irradiation. 
If the radiation levels are further increased, the effective charge density, 
which is now negative, will increase. This will result in an increase in the 
voltage required for full depletion (see eq. (9.4)) and if the applied high 
voltage cannot be increased sufficiently, there will be signal loss. For ex- 
ample, an initial n-type silicon type will invert to become p-type. We ob- 
serve an initial decrease in the full depletion voltage up to type inversion 
and then an increase. 
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The value of neg depends on the temperature and time as well as flu- 
ence, because of interactions amongst mobile defects. Over short periods 
of time after irradiation there is ‘beneficial’ annealing and the value of 
Neff Will decrease. Over long periods of time (years) there can be ‘reverse’ 
annealing which results in Neg¢ increasing. Both beneficial and reverse an- 
nealing happen faster at higher temperature. In general, in order to limit 
the growth of the full depletion voltage (Vpp) for a p*-in-n detector oper- 
ating for many years in a high radiation environment (e.g. an LHC silicon 
detector) it is essential to keep the detector cold once the detector has re- 
ceived a significant fluence. 


If we start with initial p-type silicon, 
there will be no type inversion, the value 
of Neg (and therefore the full depletion 
voltage) will increase monotonically 
with fluence, in accordance with the 
NIEL hypothesis. 


Figure 9.38: Left: Effective doping 
concentration (depletion voltage) as 
a function of particle fluence for a 
standard n-type silicon detector. Data 
were measured with a short annealing 
time. The data were extrapolated to 
zero annealing time and then used to 
fit a parameterisation of the depletion 
voltage versus fluence. The points 
shown are calculated values based on 
this 

parameterisation. Right: Evolution 
of the effective doping concentration 
as a function of annealing time. The 
data shown here were taken at room 
temperature while the annealing took 
place at 60 °C [378]. 


This complex annealing behaviour can 
violate NIEL scaling. This has been 
used to optimise the radiation toler- 
ance of silicon by adding impurities. 
These studies showed that the value 

of nest after irradiation was lower for 
detectors with higher oxygen concentra- 
tions [336]. 
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CHARGE TRAPPING 


Moving charge carriers (which generate the induced signal in our detec- 

tors) can be captured by trapping levels. If they are not released quickly ie. in a time shorter than the integration 
. . : . A : 5 time of the detector readout. 

enough this will result in a loss of signal. If the mean trapping time is Teff, 

then the signal will decrease with time exponentially 


t 
Q(t) = Qoexp (-2) ; (9.32) 
Teff 
Values of Te have been measured separately for electrons and holes 
as a function of fluence. The variation with the fluence @ can be simply 
parameterised as 


1/ tere = 1/1 + BO. (9.33) 


For irradiated detectors, the initial value of 1/ oe is usually negligible. 


= T <= aL T Figure 9.39: Inverse trapping time as 
4 0.5} 24 GeV/c proton irradiation A function of particle fluence as measured 
T i at 0 °C after an annealing of 30 to 
z 04+ e data for electrons ae E 60 min at 60 °C. [378]. 
data for holes ee 

oO pa 

g 0.3} m 4 

=) 

F 02 ae 

a i aw | 

a Foa 

5 0.17 9i J 

d ie) 

A w 

3 ols 

—& 0 210% 4104 610 810 10" 


particle fluence - ®,, [em?] 


For very high fluences (> 5 x 10!4 Neq /cm?) the loss of signal is the 
limiting factor in the operability of silicon detectors. As electrons have 
higher mobility than holes in silicon, they will have shorter charge collec- 
tion times. Therefore electrons are less sensitive to charge trapping than 
holes. This makes n*-in-p sensors less affected by charge trapping than 
p*-in-n for high fluences (~ 10!° Neq /cm”). This is the reason why AT- 
LAS and CMS have decided to use n*-in-p rather than p*-in-n silicon 
detectors for the upgrades for HL-LHC. 
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PREDICTION OF RADIATION DAMAGE EFFECTS IN SILICON 


We have seen that radiation damage effects have a serious impact on the 
long-time operability of silicon detector systems, with the ultimate loss 
of S/N. It is therefore essential for the design of silicon detector systems 
in high intensity environments to be able to predict the radiation damage 
effects, in particular in response to the temperature history of the system 
and the profile of fluence over the lifetime of the detector. 

Particle fluences at different locations in an experiment can be 
simulated with advanced Monte Carlo event generators and particle 
transport codes as implemented in GEANT, and can be translated into 
1 MeV neutron equivalent fluences using the NIEL hypothesis. The leak- 
age current can be predicted from radiation damage parameterisations, 
like the Hamburg/Dortmund model [377, 320] or the Sheffield-Harper 
model [281]. 


Such studies have been made for the silicon trackers for ATLAS, CMS 
and LHCb, and compared to measured leakage currents, together with data 
on instantaneous luminosity and device temperature as a function of time, 


to check the validity of the models and to verify the long-term prognosis 
of performance for the systems in the high-intensity LHC environment. 
Excellent agreement has been observed. 
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The prediction models can also be 
combined with models of the thermal 
behaviour as described in section 9.6. 


Figure 9.41: Comparison between data 
(points) and Hamburg/Dortmund model 
predictions (lines with uncertainties 
shown by the coloured bands) of the 
leakage current per unit volume scaled 
to a sensor temperature of 0°C of 

the four layers of the ATLAS barrel 
SCT. The integrated luminosity and the 
average sensor temperatures are also 
shown. The blue shading and label ‘HT’ 
indicate periods of heavy-ion running, 
while extended periods with no beam in 
the LHC during which the SCT was off 
are shaded grey [78]. 


Figure 9.42: The effective depletion 
voltage against fluence for sensors for 
the LHCb VELO detector [27]. The 
depletion voltage from data is compared 
to the predictions by the Hamburg 
model. Note type inversion at about 

105 Neq/cm. 
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9.12 3D SENSORS 


A silicon detector design aimed at improving the radiation tolerance is the 
3D sensor. Here the heavily doped implant regions are not on the surface 
as in the planar sensor, but penetrate as pillars through the thickness of 

the sensor. The spacing of the pillars is smaller than the thickness of the 
wafer. In both approaches the silicon sensor can have the same thickness, 
and therefore the MIP signal before irradiation will be similar. However, 
after doping changes due to radiation damage the planar sensor will re- 
quire too high a bias voltage to be fully depleted and the charge trapping 
will also reduce the signal. In the 3D sensor the required depletion depth is 
smaller and the drift distance is shorter. 


© electrons 
@ holes 


The other important advantage of 3D sensors is that they can have ‘ac- 
tive edges’, unlike planar detectors which need some type of guard ring 
structures to avoid leakage current from the sensor edges. 

The fabrication of 3D sensors is more complicated than that of pla- 
nar sensors. The 3D sensors require a process called Deep Reactive Ion 
Etching (DRIE) to form the columns [315]. In one approach to DRIE a 
short pulse of a suitable plasma is used to etch the silicon isotropically. 
The walls of the resulting hole are then passivated with a gas such as CyFg. 
When a second plasma etch is applied the etching is only in the vertical 
direction because of the passivation at the sides. Therefore the two steps 
can be repeated multiple times to create the columns that are the full depth 
of the silicon. Thermal-diffusion steps are then used to drive in dopants to 
form the n* and p* electrodes. 

As 3D sensors are more radiation tolerant but more expensive than 
planar sensors, it makes sense to use them for the innermost layers in a 
collider detector. 3D sensors were used for the highest radiation area of 
the innermost pixel layer of the current ATLAS tracker [217] and will 
be used for the innermost barrel layer for the ATLAS tracker for the 
HL-LHC [283]. 
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Figure 9.43: Schematic design of a 3D 
sensor. 


See for example [316] for an overview 
of the process steps involved. 
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9.13 OTHER SEMICONDUCTORS 


Although silicon has many properties that make it the best semiconduc- 
tor for tracking detectors, the low atomic number (Z = 14) makes it less 
suitable for high resolution X-ray detectors. Germanium is better suited 
for this purpose, as it has a higher atomic number (Z = 32). Germanium 
has a much smaller bandgap than silicon, Eg = 0.67 eV. This results in 
one complication in the use of Germanium detectors compared to silicon: 
even without radiation damage they need to be cooled. Compound semi- 
conductors are also an attractive material for X-ray detectors. CdTe and 
Cd ,_,Zn,Te (abbreviated as CZT) have high atomic numbers (Cd: Z = 48, 
Zn: Z = 30, Te: Z = 52) and a large band gap (1.44 and 2 eV, respectively), 
but they are challenging to produce in the required purity, and the size of 
these crystals which can be grown is still limited to a few cm. 

The radiation damage effects seen in silicon have prompted research 
on alternatives for the use in tracking detectors. One alternative semicon- 
ductor considered was GaAs (Ga: Z = 31, As: Z = 33). Initial radiation 
damage studies using low energy neutrons showed superior radiation re- 
sistance. However, further studies with higher energy hadrons showed 
a reduction of the charge collection efficiency (CCE) due to trapping of 
electrons and holes [116]. 

One benefit of GaAs is the very high mobility for electrons 
(8500 cm?/Vs), which makes GaAs attractive for timing applications, 
and allows for thinner sensors, but it requires n*-in-n or n*-in-p geome- 
tries [419]. Another advantage of GaAs is that it has a direct band gap, 
which means that it absorbs and emits light efficiently. 

Other semiconductors that are investigated for radiation-hard detectors 
are SiC or group II-V compounds like GaN [453]. The challenge with 
these materials is the production of thick enough sensors and fabrication 
methods are not yet mature for large scale detectors. 

Another idea for very high radiation tolerance is to use diamond. Dia- 
mond has a bandgap of 5.5 eV, and is therefore considered to be an insula- 
tor. The average energy required to create a electron-hole pair is 13.1 eV, 
a factor of about four larger than for silicon. The resulting signal is also 
reduced by the very short achieved carrier lifetimes of @(100 ns). 

As diamond is an insulator, there is no need for a bias voltage to de- 
plete the substrate and as the bandgap is so large, there is no need for 
cooling even for irradiated detectors. Radiation damage does result in the 
shortening of the mean free path of charge carriers, but the rate is smaller 
than in silicon. Crystalline diamond can be made using Chemical Vapour 
Deposition (CVD), but the production of material with sufficient thickness 
and uniformity is challenging, and this technology is still very expensive. 
Diamond detectors have been used at the LHC for very small systems re- 
quired in the highest radiation regions for beam protection monitoring and 
luminosity measurements. 
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x represents the fraction of Zn in the 
compound. 


GaAs has a wide band gap of 1.41 eV, 
and a relatively low electron-hole pair 
creation energy (4.2 eV). 


GaAs has the highest efficiency for 


photovoltaic conversion of light to 


electricity. 


This is not very significant for fast LHC 
detectors. 


For a summary of the performance 
achieved see [18]. 


The losses from the very high intensity 
beams hitting the collimators can poten- 
tially result in dangerously high fluxes 
of secondary particles that can damage 
the detectors. In order to minimise the 
risk of damage to the silicon detectors 
the beam protection system enables the 
beams to be aborted if the fluxes are too 
high. 
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Key lessons from this chapter 


e The fundamental idea of the most common silicon detectors is to use a reverse bi- 
ased pn junction to create a high electric field region depleted of free carriers. A fast 
charged particle passing through such a detector creates a large number of electron- 
hole pairs which drift in the electric field. The resulting induced current is detected 
and amplified. 


e A depletion depth of about 300 um results in sufficient signal, so that no internal 
amplification is required. 


e Internal amplification can be achieved by introducing another doping layer. Gains 
from a few tens to 10° and higher can be achieved. Internal amplification can be used 
to reduce sensor thickness, for improved timing performance and detection of photons 
(even single photons). 


e The standard silicon detector configuration is p*-in-n. n*-in-n give higher radiation 


hardness, at the cost of increased manufacturing complexity. Very good radiation 
tolerance can also be achieved with n*-in-p sensors which are easier to manufacture. 


e The reverse biased diode junction displays a thermally generated leakage current. The 
leakage current causes shot noise, and the heat generated by the leakage must be re- 
moved by a cooling system, as the system can otherwise become thermally unstable. 


e Bulk radiation damage results in the creation of energy levels in the forbidden gap. 
e This will result in significant increase in the detector leakage current, therefore 
it is essential to operate irradiated detectors cold. 


e The defects will also change the effective charge density in silicon and therefore 
change the required full depletion voltage. 


e The effects of charge trapping will reduce the signal after a very high radiation 
fluence. This is the ultimate limit on how large a fluence can be tolerated. 


EXERCISES 


1. Charge carrier density in pure silicon. 


Undoped pure silicon has an intrinsic charge carrier density of n; ~ 10!° cm~? at room temperature 
(see section 9.11). 


a) Find the number of electron/hole pairs per unit thickness in a detector element with an area 


of 1 mm?. 


b) Compare this with the ionisation produced by the passage of a minimum-ionising charged 
particle and comment on the realisability of a practical detector (density of silicon: 
p = 2.33 g/cm’, energy required to produce an electron/hole pair: E = 3.6 eV). 


2. Position resolution of a strip detector. 


a) For a silicon microstrip detector with strips of 80 um width and 120 mm length, assuming 
binary readout, what is the resolution in the strip direction (z) and the orthogonal direction 
(rọ). Why might the resolution in practice be better than indicated by the simple estimate? 
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b) Improved resolution in the z direction can be achieved by using two back to back sensors 
with their strips running at different angles. The angle between the strip directions in the two 
sensors is called the stereo angle. Show with the aid of a simple sketch how if there are mul- 
tiple hits on a sensor pair, we can reconstruct the correct hit location, but there will be addi- 
tional false locations found in a high-rate environment (‘ghost hits’). 


c) How does the z-resolution and the rate of ghost hits depend on the stereo angle? For a col- 
lider detector, why are the resolution requirements different for the z and r@ directions? 


3. Properties of a pn junction. 


a) Perform the integral of the distribution of the number of electron states as a function of en- 
ergy n(E) to verify the result of eq. (9.1), justifying any approximations you make. The non- 
relativistic density of states is given by 


b) Derive eq. (9.3). 
c) Verify the results in figure 9.2. 


4. Fermi energy in silicon. 


a) Find an expression for the Fermi energy Ep in an intrinsic semiconductor, in terms of Ee, Ey 
and the effective masses of negative and positive charge carriers. 


b) Repeat the calculation of part a) but now for a doped semiconductor with a volume density of 
donor dopants of N4. 


5. Pulse shape for a simple pad detector. 


Consider a simple silicon p*-in-n pad detector. Consider MIPs crossing the detector orthogonally. 
The detector is 300 um thick and the acceptor density is na = 10!* cem-?. 


a) Ignoring the built-in voltage, what is the full depletion voltage? 

b) The mobility of electrons and holes are 450 cm?/(Vs) and 1400 cm?/(Vs). Assuming elec- 
tron/hole pairs are created uniformly along the path of the MIP, what is the induced current 
with time in a pad detector? 

c) Calculate the longest charge collection time for electrons and holes. 

d) Now assume that the detector is ‘over-depleted’ by 100 V. Sketch the resulting electric field 
in the detector and compare it with that of the just-depleted case. Calculate the pulse shapes 


for electrons and holes for the over-depleted case and compare this to the results of the just- 
depleted detector. 


6. Time resolution. 
Consider a detector with an intrinsic rise time fg, read out with an amplifier with a rise time fg. 


a) What is the rise time at the output of the amplifier? 


b) The noise output of the amplifier is oy = eA f, where e? is the noise spectral density and Af 
is the noise bandwidth. The noise bandwidth Af = a/ta, where a is a constant. Show that the 
timing jitter 
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c) Hence show that the optimal time resolution is obtained when ta = fg. 


. Displacement damage to silicon lattice. 


The energy imparted to a silicon atom needs to be at least 25 eV for the silicon atom to be dis- 
placed from its location in the lattice. Determine the minimum kinetic energy required for this 
process for (a) a proton, (b) an electron and (c) a silicon atom. 


Comment on the implications of these results on the displacement damage. 


. Leakage current and shot noise. 


Consider a silicon microstrip detector with strip widths of 80 um, length 120 mm and thickness 
300 um. Calculate the leakage current in one strip for two cases: 


a) For an unirradiated silicon detector with a generation lifetime of Tọ = 1 ms. 

b) After an irradiation with a fluence of beg = 1014 negem ~? (ignoring annealing effects). 

c) Estimate the shot noise for these two cases assuming a shaping time of t = 10 ns and consid- 
ering the silicon temperature to be —20 °C, 0 °C and +20 °C. 
Compare these values with the signal expected from a MIP traversing the detector (the aver- 
age energy required to create an electron-hole pair in silicon is 3.6 eV and the energy loss for 
MIPs in silicon is 1.664 MeV g7 !cm? and the density is 2.329 gem™°). Discuss the signifi- 
cance of these results for the operation of silicon detectors. 


Linear model of thermal behaviour of silicon systems. 


a) For the linear model discussed in section 9.6 and based on the condition for thermal stability 
shown in Figure 9.17, show that for the critical sensor temperature Tg crit the following rela- 
tion holds: 

Tei + (Ta — 2To)Ts,crit — Ta To = 0. 
Hence show that eq. (9.24) applies. 
b 


eS 


Sketch the relation between the sensor temperature and the coolant temperature described 
by eq. (9.25). What is the slope at the end of the stable region? Invert this relation, and find 
eq. (9.26), which gives the critical sensor temperature if the leakage power Qef at a tempera- 
ture Tief is known. 


Show using this model that to maintain thermal stability at end of life the local sink tem- 
perature for the ATLAS SCT (R, = 0.945 K/W, R: = 2.164 K/W, Qn = 6 W, sensor area 
163 cm?) for an expected sensor leakage power of 2 W at 0 °C must be at most —17 °C. 


c 


wm 


d) How much would the coolant temperature have to change if the leakage power doubles? 


Responsivity of a photodiode. 


What is the reflectivity of an air-silicon boundary at a wavelength of 850 nm? If a photodiode 
uses an anti-reflective coating to reduce the reflected power to be negligible compared to the in- 
cident power, what is the minimum thickness of the active i layer to ensure that the responsivity 
R > 0.5 A/W? You may assume that all converted photons contribute to the signal. 


The index of refraction for Si at 850 nm is 3.65. The attenuation constant for silicon at 850 nm is 
535 cm™!. 
Depletion voltage and resistivity. 


We wish to be fully deplete a p-in-n silicon detector with a thickness of 300 um at a bias voltage 
of 100 V. 
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a) Show that the resistivity is related to the carrier density n and mobility u (assume a linear 
relationship between drift speed and electric field strength) by p = 1/(neL). 


b) Neglecting the ‘built-in’ voltage of the diode calculate the carrier density required to fully 
deplete the sensor at a bias voltage V. Hence estimate the minimum resistivity of the n-type 
silicon required. 


c) Why would the minimum resistivity for n*-in-p detectors be larger? 


The relative permittivity of silicon is 11.7 and the electron mobility is 1400 cm?/(Vs). 


0 Tracking 


Tracking detectors are designed to reconstruct the trajectories of charged 
particles as they traverse the tracking volume. Usually a magnetic field 
will be present in this volume. In this field, the trajectories will curve and 
we can reconstruct the momentum of the charged particles from the curva- 
ture of the track. From the direction of the curvature we can determine the 
sign of the charge of the particle. 

After reconstruction of the trajectories of the charged particles, we 
can reconstruct the ‘primary’ vertex. In a collider experiment, this will be 
where the beam particles interacted, in a fixed target experiment, this will 
be where the beam particle interacted with the target. Particle trajectories 
not coming from the primary vertex can be associated with secondary and 
tertiary vertices. These can arise from the decays of long lived particles 
like hadrons containing b or c quarks, or T leptons. 

A sudden change in direction (‘kink’) of the charged particle can also 
indicate a decay of the original charged particle, for example a decay like 
T —> UVy, Or an electron emitting bremsstrahlung. 

If the energy deposited by the charged particle for each spacepoint is 
recorded, we can combine these measurements to estimate the energy loss 
dE /dx and use this for particle identification (see section 12.3). 

The reconstructed tracks can be extrapolated into the electromagnetic 
calorimeter and compared to the location of energy depositions there. This 
is used in electron identification (see section 12.7). The trajectory of the 
tracks can also be extrapolated to ‘muon’ chambers outside the calorime- 
ters. This can be used to improve muon identification and measurement 
(see Section 12.8). 

Fast track reconstruction has been used in some detectors for low level 
trigger decisions. In general, track reconstruction can be used in high level 
triggers (see chapter 13). 

Tracking detector designs will be very dependent on the particular ac- 
celerator at which they are employed. Tracking detectors for low energy 
collisions will emphasise reducing the material in the detector to min- 
imise multiple scattering. For high energy collisions such as at the LHC 
it will be necessary to have a large enough tracking volume to measure the 
momenta of high momentum particles with sufficient precision. For de- 
tectors operating with a large number of interactions per bunch crossing 
(‘pile-up’) the detector needs to have sufficient granularity to be able to 
reconstruct trajectories in very dense track environments. This implies that 
we need detectors with sufficient granularity to ensure that the fraction of 
elements hit (‘occupancy’) is sufficiently low. 

We will start this chapter with a discussion of the finding and recon- 
struction of tracks. We will also review the incorporation of precise tim- 
ing measurements in the track reconstruction (‘4D tracking’). An impor- 
tant ingredient for the connection of space-points is the knowledge of the 
position of the individual detector components, which we refer to as the 
‘alignment’. We will then discuss the measurement of the momentum of 
a charged particle from the curvature of its track in a magnetic field and 
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The space points used in track recon- 
struction usually come from a combi- 
nation of gaseous (chapter 7) and/or 
semiconductor detectors (chapter 9). 
Space points can also be measured in a 
liquid TPC (chapter 8). 


For tracking detectors using a few high 
precision layers of silicon detectors, 

a rule of thumb is that the occupancy 
should be below 1%. For tracking 
detectors using many more layers of 
lower precision gas chambers much 
higher occupancies can be tolerated. 
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we will look at common geometries for spectrometer magnets. Finally, we 
will discuss the reconstruction and measurement of interaction vertices. 


10.1 TRACK RECONSTRUCTION 


A powerful way to find the kinematic properties of charged particles even 
in an environment with many particles is the measurement of the position 

at multiple locations along the trajectory, and the combination of the indi- 
vidual space-points to a track. Depending on the detector used, the tracks 

are typically reconstructed from either a limited number of points with ex- 
cellent space point resolution (typical for silicon trackers), or many points 
with a somewhat poorer resolution (e.g. in a TPC). Independent of the na- 


ture of the track, the usual steps in the reconstruction are For a detailed review of pattern recogni- 
tion and track reconstruction see [253]. 


1. Find combinations of space points that are likely to belong to one 
track (‘pattern recognition’); 


2. Find for each track the track parameters, and calculate the associ- 
ated covariance matrix to estimate the errors on these parameters. 


This is usually done by a fit of a track model to the data. In a homogenous magnetic field the 
track model will be a helix. In more 


i x i r 7 complex magnetic fields the track 
3. Test the track hypothesis, and identify outliers and wrongly assigned model can be found from stepwise 


points and remove them from the track fit. This includes detection numerical integration (for example 
$ š í ie ag 2 using the Runge—Kutta—Nystrém algo- 
of kinks in the track, which are indicative of a decay vertex. rithm [347]). 


The first track finding step can be performed with global or local meth- 
ods. A popular example of a global method is the Hough transform [253]. 
A simple demonstration for a Hough transform can be given for the case 
of hits lying along a straight line y = cx +d. The hits are transformed from 
‘image’ to ‘parameter’ space using d = —xc +y. This results in lines in pa- 
rameter space which overlap at the correct values for c and d in parameter 
space. Therefore values of (c,d) corresponding to candidate tracks can be 
found from peaks in the 2D histogram of c versus d. A general discussion 
of local methods will be given below, and a specific example in the case of 
ATLAS will be discussed. 

For very complex events, e.g. in high pile-up at LHC, there are several 
complications. One constraint is the limited computer resources, so one 
approach is to use a limited number of layers to look for ‘seed’ tracks and 
then extrapolate the seed tracks to find additional hits on the track. After 
candidate tracks have been found there will be many cases of hits shared 
between different tracks. An algorithm is needed to assign these ambigu- 
ous hits to the best track. After this process, some tracks will have too few 
hits and will be rejected. Therefore there will be a trade-off between the 
track finding efficiency and the number of spurious tracks. 

The quality of the track parameters will be affected by measurement 
errors of the particle position in the different detector layers, errors on 
the knowledge of the position of the sense elements with respect to each 
other and a global coordinate system, and disturbances of the particle’s 
path, like for example by multiple scattering, decays and energy loss of the 
particle. Track reconstruction is complicated in high-rate environments by 
ambiguities in the assignment of individual hits to specific tracks. 
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In principle, a simple least square fit of the track model to the mea- 
sured datapoints gives the optimal estimate of the track parameters (if the 
measurements are strictly linear functions of the track parameters, and 
all stochastic disturbances are Gaussian). However, in practice the least 
squares method requires inversion of large matrices. The most common 
approach in track reconstruction is therefore to use a Kalman filter [252], 
which is a two-step recursive method, consisting of a prediction, where the 
track hypothesis is extrapolated to the next detector layer using the track 
model and the track parameters determined up to this point, and an up- 
date, where the information from this layer is used to update (and ideally 
improve) the estimate of the track parameters. This sequence is repeated, 
until all the detector layers are included, or if no suitable hits are found 
over several layers. After this forward filtering backwards smoothing is 
performed [253]. 


TRACKING IN LHC INNER DETECTORS 


Tracking is very challenging for the general purpose LHC experiments 
because of the very large pile-up. It is particularly challenging in the core 
of jets, where the particle density is significantly higher. We will briefly 
review some of the ideas used for tracking in the ATLAS ID to illustrate 
how tracking is performed there. 

The primary pattern recognition uses an ‘inside-out’ algorithm. 3D 
space points from the precision silicon detectors are reconstructed in a 
global coordinate system. Candidate track segments (seeds) are formed 
from combination of hits in the pixel detector. The seeds are then used to 
form ‘roads’ and these are used with a Kalman filter to successively add 
hit coordinates to the track. At each stage ‘outliers’ are rejected using a 77 
cut on the fitted track segment. This process results in a very large num- 
ber of ambiguities in the resulting track candidates. The ambiguities are 
resolved by ordering the tracks according to a measure of quality. This 
uses the fit %? but it also includes information about whether particular 
layers were missing hits. Hits that are shared between tracks are gener- 
ally assigned to the track with a higher score and the quality score is re- 
evaluated. At the end of an iterative process, track candidates with a low 
score are discarded. The candidate tracks are then extrapolated into the 
straw tracker (TRT) and compatible hits there are added to the track. 

This inside-out tracking can fail to find tracks that arise from decays 
or conversions. Therefore an outside-in tracking stage is also performed 
to recover this class of tracks. The track finding in the TRT uses Hough 
transforms in the two 2D views. Global track candidates are simply found 
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A least square fit minimises 
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where the x?®® are the measured po- 
sitions in 1, 2 or 3 coordinates, © (xi) 
are the uncertainties on the measured 
positions and rodel are the positions 
predicted by the track model for the set 
of parameters p, and the sum runs over 
all measured points. 


Due to the iterative approach outliers 
can be rejected at each step, and the 
algorithm can be used to find the most 
plausible direction of the track in the 
case of ambiguities by branching out 
track hypotheses at ambiguous points 
and choosing the most plausible com- 
bination of hits at the end (determined 
from the x). 


Figure 10.1: Example for the use of 

a combinatorial Kalman filter for the 
track reconstruction in the HERA-B 
outer tracker [357]. The track recon- 
struction proceeds from the right to the 
left and the black triangles indicate 

the track parameters at each step. The 
reconstruction has been seeded with a 
first estimate on T1. The detector is a 
drift chamber with hexagonal drift cells 
and the hits are indicated by circles with 
a radius corresponding to the measured 
drift time. 


This is straightforward for the pixel 
detectors but for the SCT microstrips 
this requires matching hits on the two 
stereo sides of a module. 


The outlier is rejected, if the 4° after 
inclusion into the track is larger than a 
threshold value. The threshold depends 
on the number of degrees of freedom 
and the probability to falsely reject a 
genuine hit [252]. 
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by looking for peaks in the 2D histograms in the transformed parameter 
space. These track candidates are used to seed another Kalman filter to 
find track segments in the straw chambers of the TRT. These tracks are 
then extrapolated into the silicon trackers to add any hits on the track that 
were not already used by the inside-out tracker. This procedure then re- 
covers the tracks from photon conversions and decays that might have 
been missed by the inside-out tracking. 

The tracking is particularly challenging in the core of high-pr jets 
where the track density is very high. In this environment there are often 
cases of clusters in the silicon detectors which are assigned to multiple 
track candidates. The efficient resolution of these ambiguities is the key 
to maintaining good performance in dense environments. For the case of 
pixel clusters, a neural network is trained to find merged clusters. The in- 
put uses the measured charge in the pixels and the angles of the candidate 
tracks crossing the cluster. 

The tracking efficiency for muons can be estimated from Monte Carlo 
simulations. The efficiency varies with muon 7) and pr. In order to min- 
imise systematic uncertainties associated with inaccuracies in the simula- 
tion, data-driven methods are used to correct the simulation. This is done 
using the ‘tag and probe’ method, investigated in exercise 6. 

It is more difficult to make a data-driven efficiency measurement for 
pions and therefore this is usually estimated using Monte Carlo simulated 
samples. An example plot showing the efficiency as a function of the an- 
gular separation between the track and the jet axis [79], AR is shown in 
Figure 10.2. We can see that there is a significant decrease in efficiency in 
the core of a jet which is due to the very high local particle density. The 
loss of efficiency becomes more significant as the jet-pr increases because 
of the increase in multiplicity of jets with pr and because the jet becomes 
more collimated. The efficiency ‘plateau’ at large values of AR is signif- 
icantly below 100%. This is mainly due to hadronic interactions in the 
material of the inner detector. The efficiency for muons which do not un- 
dergo these interactions and are not in the core of jets is very much closer 
to 100%. 
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Electrons can undergo large energy loss by bremsstrahlung and the 
radius of curvature of the electron in the magnetic field will decrease when 
this happens. The classic Kalman filter approach cannot accommodate this 
sudden change. In these cases ATLAS uses a modified Kalman filter that 
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The TRT uses longitudinal and radial 
straw tubes in the barrel and endcaps. 
Therefore, the barrel (endcap) provides 
information in the transverse r — @ 


(z—@) view. 


We can use the same type of tag and 
probe methodology to make many other 
measurements such as trigger and 
electron finding efficiencies. 

There is no low-background source of 
di-pion events. 


AR = \/(A@)? + (An)?, where Ad 
is the distance in the azimuthal angle 
between the track and the jet axis, 
with an equivalent definition for the 
pseudorapidity interval An. 


Figure 10.2: The track finding efficiency 
as a function of angular separation from 
the jet axis, AR for jets in three different 
momentum ranges [79]. 


The probability will be significant in 
any tracking detector which has O(1) 
radiation length. 
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allows for up to 30% energy loss at each material intersection [80]. Finally 
for tracks loosely matched to electromagnetic clusters, a Gaussian sum 
filter (GSF) algorithm can be used to obtain optimised estimates of the 
track parameters. 


10.2 4D TRACKING 


Traditional particle detectors reconstruct the tracks of particles accurately 
in space but not in time. An ideal detector would provide accurate infor- 
mation in 3 spatial and 1 time coordinates, which would allow full 4D 
tracking. As we have discussed in chapter 9 the pixel and strip detectors 
that provide precise spatial resolution have modest time resolution. On the 
other hand, silicon detectors that have been optimised for very good time 
resolution (for example LGADs, see section 9.9) use pad detectors giving 
very modest spatial resolution, and these are going to be introduced into 
the LHC general purpose experiments. 

At the general purpose LHC experiments the number of interactions 
in the same bunch crossing (u) is very large at peak luminosity. We wish 
to reconstruct the particles from the ‘hard-scatter’ that triggered the event 
and not be affected by the particles from pile-up. The jet reconstruction 
can result in fake jets from particles from pile-up. If we can resolve the 
different primary vertices, we can reject these pile-up jets by requiring 
that a large fraction of the tracks associated to a jet are from the primary 
vertex. This works well at LHC but would not be effective at HL-LHC. 
At HL-LHC the maximum pile-up is u = 200 interactions, and the mean 
density of vertices along the beam direction will be around 1.4 mm7 !, 
with fluctuations up to a density of 3 mm~! [82]. This figure is too high 
for the tracking detectors to efficiently separate these vertices. 

The effects of pile-up can be mitigated by the addition of a timing 
layer in the forward and backward directions. Tracks from very closely 
spaced 3D vertices can be efficiently separated if there is timing infor- 
mation. This can be seen from the following simple example: the vertex 
location (zo) for collisions between particles at the ‘head’ of the bunches 
in both beams will result in the same value of zg as for collisions between 
particles at the tail of the bunches in both beams. However, the time of the 
interaction will be earlier for the first case. 

A sufficient separation of vertices can be achieved with a timing res- 
olution per track of 30 ps, which can be achieved with LGADs (see sec- 
tion 9.9). 

A more ambitious use of 4D detectors has been proposed for the 
LHCb Upgrade II, with the aim of creating a full 4D tracker [399]. Differ- 
ent technologies are being considered for the various systems. 3D silicon 
pixel detectors are a very attractive option for the detectors closest to the 
beam pipe as they have the best radiation tolerance and can have thicker 
active depths and thus give larger signals which improves the timing reso- 
lution. The aim is to achieve a few tens of ps resolution per particle. This 
would allow the same performance to be achieved as for Upgrade I but for 
a further increase in luminosity by a factor of 7.5. 
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GSF is a non-linear generalisation of 
the Kalman filter. 


The LHC tracking detectors had just 
enough time resolution to assign hits to 
the correct bunch crossing. 


ATLAS measured a luminosity weighted 
average value in 2018 of u = 36.1 [81]. 


The resolution in the longitudinal im- 
pact parameter is worse in the forward 
directions. 
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10.3 ALIGNMENT 


To correctly link the space points recorded in different parts of the track- 
ing system, the position of all detection elements in a global coordinate 
system must be known to a level that it does not introduce a significant 
degradation of the spatial resolution beyond the intrinsic element resolu- 
tion. Several approaches are possible. 


Build accuracy 


During assembly, the position of the detection elements can be controlled 
by precision machining (in reductive or additive manufacturing), or by 
precision moulds or jigs (for composites or gluing). However, the typically 
required accuracy can usually only be achieved for items of limited size. 


Assembly surveys 


After construction, the position of detector elements can be measured, 
either by an optical or mechanical survey, or using X-rays. 

For both these approaches, precision build and/or assembly survey, 
the limitation is that deformations of the geometry after the build, whether 
during subsequent stages of the installation or due to mechanical or ther- 
mal loads during the operation of the detectors, are not captured. However, 
for objects of limited size and high internal stiffness assembly-based spa- 
tial information can remain accurate, so that the information can be useful 
to reduce the number of parameters to be determined by other alignment 
methods. 


Hardware alignment 


During operation deformations of the mechanical structures supporting 
the detection elements can be tracked by hardware alignment systems. 
Such systems are typically optical, and can achieve sub-um absolute ac- 
curacy, although the accuracy will degrade with the size of the system, and 
be worse for large muon systems. Examples for measurements in the di- 
rection of the optical axis are Frequency Scanning Interferometry (FSI) as 
used in the ATLAS SCT [208], or for measurements perpendicular to the 
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Figure 10.3: Simulation of an event 
with a hard scatter (at z ~ 15 mm) 
and 200 pile-up interactions. The plot 
shows vertex position z versus time. 
The dashed vertical lines represent the 
positions of reconstructed vertices and 
the ellipses represent the positions of 
‘truth’ vertices. The size of the ellipses 
represent the approximate expected 
resolution in z and time, 1 mm and 30 ps 
respectively [82]. 


For a semiconductor tracker this re- 
quirement can be O(1 um). 


For an example of the use of X-ray 
tomography in the survey of large 
wirechambers see [449]. 


In a magnetic spectrometer changes in 
the state of the magnetic field can also 
be a significant source of deformations. 


In a magnetic spectrometer optical 
alignment beams can be interpreted as 
infinite momentum tracks. 
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optical axis the Red Alignment System Nikhef (RASNIK) technology as 
used in the ATLAS muon system [266]. Standalone optical alignment sys- 
tems have the disadvantage that they require a step of extrapolation from 
the position of the optical elements to the detector elements. This can be 
overcome by using the detector elements themselves for the detection of 
the optical alignment reference as done for (some set of the modules of) 
the CMS tracker [457]. 


Track-based alignment 


In track-based alignment (also known as software alignment) a high- 
quality subset of the data recorded in actual physics events is used to re- 
construct the positions of the detector elements offline. The optimal soft- 
ware alignment algorithm uses the global y? fit approach. A 7? is cal- 
culated as the sum of the residuals for all the hits in the sample of high- 
quality track fits and the optimal alignment constants are obtained by 
minimising this 77. This technique has been used in the alignment for 
the inner trackers of the LHC experiments (see for example ATLAS [83], 
CMS [198]) with great success, demonstrating that the position of the 
modules in these large systems can be aligned at the um level, and that 
software alignment is even capable of reconstructing the shape of individ- 
ual detector modules. 

The number of degrees of freedom for a large silicon detector system 
can be several 10°, and it is typically beneficial to introduce a hierarchy of 
sub-structures, which matches the actual hierarchy of mechanical support 
structures, starting at large units (barrels, endcaps, etc.) with limited de- 
grees of freedom, and becoming progressively more detailed. Experience 
shows that movements of large units are most frequent, while internal de- 
formations of these units are usually small, so that internal realignment is 
required infrequently (only every several months). 

One of the challenges in the more refined levels of alignment is that 
the global x? fit leaves certain classes of coherent degrees of freedom 
(twists, telescoping, etc.) undefined, with more or less arbitrary position 
parameters. These deformations are called ‘weak modes’, and they need 
to be constrained by other means (cosmics, higher-level analysis like mass 
peaks, etc. and their invariance in different directions). 

Probably the most important of these weak modes is curl because 
this can result in equally good track fits but with very different values of 
the reconstructed pr and even result in charge asymmetries. These weak 
modes can be very well constrained by cosmic rays (mainly muons) in the 
barrel region that pass through the entire detector. The reconstruction of a 
common track with one value of pr very strongly constrains the curl. An- 
other powerful technique to constrain the curl is to measure the energy E 
and momentum p of electrons. Any curl will bias the distribution of E /p. 

Other weak modes are described in [235]. Several types of measure- 
ments can be used to constrain these modes including the reconstructed 
masses of decays like K? — m+ m~ and reconstructing beam gas events. 

Software alignment is the most powerful in the direction where the 
tracking layers are the most precise (typically in the detector plane for 
planar detectors); it is less capable in the direction perpendicular to these 
layers. 
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For a more detailed discussion of the 
alignment of the ATLAS muon system, 
see [23]. 


A high-quality subset comprises typi- 
cally high-momentum tracks with small 
errors on the track fit parameters. 


If information from a hardware align- 
ment system or tracks from cosmics are 
available they can be included in the x7 
fit. 

Track-based alignment works best with 
detector systems that have a high level 
of internal stiffness and high point 
position capability. It is working so 
well in the LHC silicon trackers that the 
dedicated hardware alignment systems 
described above are not required to 
determine sensor positions. 


Curl refers to correlated rotations in 
the azimuthal angle of different radial 
layers in such a way that the particle 
trajectories still give good helical fits. 


Cosmic rays are mainly vertical and 
thus cross the barrel layers of a collider 
experiment. In the endcaps, detectors 
are typically perpendicular to the beam 
line and therefore the cosmic rays do 
not cross many endcap layers. For the 
endcap detector, beam halo tracks can 
be used instead. 


Naively for ultra-relativistic electrons 
E/p = 1. However the measurements 
are severely affected by interactions like 
bremsstrahlung in the detector material 
which biases E / p. Therefore a very 
accurate knowledge of the detector 
material is essential for this technique 
to work. 
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The most important structural design requirement to support track- 
based alignment is stability over periods that correspond to offline align- 
ment cycles. Typical loads under which that stiffness needs to be achieved 
are (in order of relevant timescale): 


e Vibrations (external from ground vibrations or other experimental 
equipment, or internal, for example from coolant flow); 


e Thermo-mechanical loads, due to differential coefficients of thermal 
expansion and local and/or temporary variations of the temperature; 


e External, ‘seismic’ shocks, due to drastic changes of the environ- 
mental state of the experiment (cooling system shutdowns, planned 
or unplanned magnet ramps etc.); 


e Long-term deformations, like long-term dimensional changes 
caused by humidity variations, or relaxation processes in polymer 
components (plastics and glues). 


Usually, these loads are small or can be made small for the static ex- 
periments typical in particle physics, allowing for the aggressive optimisa- 
tion of support structures to reduce mass. 


10.4 MOMENTUM MEASUREMENT IN MAGNETIC FIELDS 


In a homogeneous magnetic field B the trajectory of a particle with charge 
ze becomes a helix with radius R and pitch angle 2. The momentum is 
then given from the Lorentz force 


Pt = pcosa = zeRB. 


In the case of three equidistant position measurements, the middle sta- 
tion provides a measurement of the sagitta with respect to the base line set 
up by the two outer stations. The sagitta of the track in the magnetic field 
relates to the momentum, 


@ RP P 
8 8R 8R- 


(0) zeBL? 
s=R—Rcos—=~R . 
2, 8p: 


If the sagitta measurement is used to determine the momentum, then the 
error on the momentum measurement is related to the error on the sagitta 
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The sagitta error is closely correlated with the position measurement error 
in each layer. The exact relation depends on the number and position of 
the point measurements. For the simple case discussed here with three 
equidistant stations s = x2 — (x; +x3)/2 and therefore, assuming the same 
position measurement error Ox in the three stations, 
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Currently at the LHC these periods 
range typically from Ô (24 h) for high 
level structures to a few times per year 
for small structures. 


An example for deformation due to 
differential thermal expansion is the 
ATLAS IBL [6], which shows signifi- 
cant detector movements as the cooling 
temperature is adjusted and as the lu- 
minosity varies during a fill of the LHC. 
Dedicated software procedures are re- 
quired to correct for these changes [83]. 


These relaxation processes are some- 
times referred to as ‘creep’. 


Often a dimensional version of this 
equation is used 


p;/GeV = 0.3z(R/m)(B/T). 


Figure 10.4: Sagitta geometry. 


L is sometimes called the ‘lever arm’ of 
the momentum measurement. 


For a large number (n > 10) of equidis- 
tant planar position measurements on a 
parabolic track [264] 


o1 [7 
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For more general expressions, including 
other geometries, see [420]. 
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It should be obvious that for a good momentum measurement a large 
tracking volume, a large magnetic field and good space point resolution 
are required. 

At low momenta, the effects of multiple scattering must also be con- 
sidered. If we stay with the simple system of three thin stations for the 
sagitta measurement with no scattering material between, then for the 
three stations 


L L 
Axı =0, Ax = 7 Ab, Ax3 = L'AO; + 7 Ab, 
where L’ = L/sin®, where © = 2/2 — À is the angle between the track 
and the magnetic field, and the A0; 2 are the deflection angles in the first 
and second station, respectively. Therefore 

L L L 

As = —7Ae and Os = gO? = Z %> 

where 4 is given by eq. (2.53). 
We simplify eq. (2.53) further (which, as discussed, already ignores 

the large-angle scattering tails), by dropping the logarithmic part to get 


L 13.6MeV, /L! 
Op, = () 4 Bep NV Xo 1 
ZEL = ~ o 
MS 


s zeBL? BB\/LXpsin® 


8psinO 


The combined transverse momentum resolution can thus be described by 


Op PtOx 1 
=C ! . 10.1 
p BL ~~ BB/LXoVsin® oe 


Hence, multiple scattering will be the dominant contribution at small mo- 
menta, while the position measurement error will dominate at high mo- 
menta and therefore the momentum resolution will increase linearly for 
high momenta. A good resolution requires large B and L, and good space 
point position resolution (and detector alignment) and small amounts of 
scattering material. 
For the resolution on total momentum, the error on the pitch angle A 
needs to be included [483], 
Op — Or o, tana. 
P Ppi 


(10.2) 


The fractional momentum resolution of the ATLAS inner detector [84] as 
a function of pr shows a constant value at low values of pr as expected 
from multiple scattering. At large values of pr the resolution is dominated 
by the error on the sagitta measurement and therefore the fractional error 
increases with p. 


10.5 SPECTROMETER MAGNETS 


In addition to the need for a high magnetic field and a large size to allow 
for a long lever arm, other considerations are relevant for the design of the 
spectrometer magnet: the field should be homogeneous, to simplify the 
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This estimation is only a justification 
for the functional form of the result. The 
challenge of all calculations of multiple 
scattering is the integration of the effect 
for thick slabs of material. An analytical 
calculation is difficult and the practical 
alternative is to obtain the multiple 
scattering distributions using Monte 
Carlo. 


For very low momenta energy loss 
fluctuations in detector material in front 
of the spectrometer can also be relevant. 


‘$’ denotes a quadratic addition of 
errors. 


Note that a large L usually implies a 
large magnet, and the stored energy 
of the magnet is proportional to the 

volume. 


Tracking 


& 0.25 AL 
cs E Split tracks t J 
S: AE -+ Data, full ID J 
a 22 -+ Data, Si only E 
C -*- Simulation, full ID J 
0.15 4 
0.1 N ' —A— a 
( ATLAS Preliminary Cosmic ’08 i— 
0.05 4 
ee ene IR 7 
oE p t yaar] È j riiiirl | 

es 
1 10 10 P; [GeV] 


track fitting and reduce non-uniformity of the response of the tracking de- 
tectors (for example due to changes of the Lorentz angle). The field lines 
have to loop back outside of the tracking volume. One way to do this in 

a controlled fashion is by providing a return yoke made from iron. The 
return yoke for central solenoid fields in collider experiments can be in- 
strumented with detectors for the tracking of muons, as the iron acts as a 
muon filter, stopping all other particles (except neutrinos). 

In particular in collider experiments the coil must fit into the available 
space, and material in magnet components crossed by tracks must be min- 
imised to limit multiple scattering. There is a fundamental design choice 
in collider experiments whether to place the electromagnetic calorime- 
ter (see chapter 11) within the coil to minimise the material in front of it 
with the penalty of constraining its depth, or outside. In the latter case, 
the deterioration of the energy resolution can be reduced by the use of a 
pre-shower detector in front of the calorimeter. To reduce the size of con- 
ductors and supply cables spectrometer magnets in collider experiments 
are usually superconducting. Finally, cost is a major factor limiting the 
size and strength of spectrometer magnets. 

Three magnet geometries are in general use: 


e Dipole: Large-volume dipoles are primarily used in fixed target ex- 
periments, where they provide a large unimpeded opening for the 
entry and exit of the close-to-parallel tracks in these experiments. 
The coils and return yoke can be placed outside of the detector ac- 
ceptance. 


In collider experiments dipole magnets have the disadvantage that 
the bending occurs only in one direction and their use there is only 
historic (for example in UA1 [75]), with the exception of forward- 
oriented experiments like LHCb [54]. 


e Solenoid: Solenoids are today the standard geometry for the central 
tracking systems in collider experiments. They provide high field 
uniformity and invariance of the magnetic field under rotations by 
the azimuth angle @. The drawbacks are that this geometry only 
provides a direct measurement of the transverse momentum pr, and 
low momentum tracks can be trapped in the field and cause high 
occupancies in the tracking detectors. 
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Figure 10.5: Relative momentum resolu- 
tion for the ATLAS Inner Detector (ID) 
as a function of pr [84] measured using 
cosmic rays. The resolution is estimated 
using cosmic rays that penetrate the full 
ID and comparing two estimates of p 

by splitting the track into two segments. 
The difference between the data and the 
simulation at large values of p is due to 
residual detector misalignment. 


In practice, the magnetic field will have 
significant non-uniformity. For example, 
a solenoid gives a very uniform field in 
the centre of the magnet, but a weaker 
and non-axial field near the edges of the 
solenoid. 


These are usually gaseous muon detec- 
tors. 


For a detailed discussion of supercon- 
ducting detector magnets see [368]. 


Figure 10.6: Dipole magnet of the 
LHCb experiment [332]. Note that 

in the drawing the rear aperture (up- 
stream) is smaller than the front to 
minimise the field volume in accordance 
with the acceptance of the experiment. 


The momentum resolution is thus poorer 
for tracks in the forward direction. 
However, this is also the direction where 
additional material increases multiple 
scattering. 
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The superconducting magnet with the largest stored energy cur- 
rently is the CMS solenoid with a stored energy of 2.6 GJ [285]. 

It has a free bore diameter of 6 m and a maximum magnetic field 
of 4.0 T that is achieved with a current of 19.5 kA. The return flux 
of the solenoid is through the iron in the hadronic calorimeter. The 
large bore diameter allows for the inner tracker together with the 
electromagnetic calorimeter to be contained inside the solenoid, 
thus minimising the ‘dead’ material in front of the electromagnetic 
calorimeter. 


An example of a smaller solenoid is that used by ATLAS [508], 
which has a magnetic field of 2.0 T using a current of 8.4 kA. The 
solenoid has an inner diameter of 2.3 m and a length of 5.3 m. The 
stored energy is 44 MJ. The inner tracker is inside the solenoid, but 
the electromagnetic calorimeter is outside. Therefore, minimising 
the thickness of the magnet (in units of radiation lengths), resulting 
in a somewhat lower field, has been critical in order not to degrade 
the resolution for electrons and photons. The impact of the passive 
material of the solenoid on the electromagnetic calorimeter is re- 
duced by the use of a pre-shower detector. 


Toroid: Toroidal magnet geometries again are in principle symmet- 
ric in the azimuth angle, and thus are attractive for collider experi- 
ments, and they have the benefit that they provide a direct measure- 
ment of the momentum. However, particles have to cross the inner 
part of the coil, and thus the most common application for toroid 
magnets is for muon spectrometers. Because of the size of such sys- 
tems, it is usually not practical to realise the field with one contin- 
uous coil, but the toroid field is approximated by several discrete 
coils, which introduces non-uniformities of the field. 


The largest toroid magnet system today is part of the ATLAS exper- 
iment, with an air-core toroid consisting of 8 coils in the barrel, and 
two endcaps with 8 coils each, surrounding the central solenoid. 


For all practical magnets the field will show significant non- 
uniformities over the detector volume. Therefore it is essential to accu- 
rately measure the magnetic field over the relevant volume. This can be 
done using Hall and NMR probes [39]. 


10.6 VERTEX RECONSTRUCTION AND MEASUREMENT 


Vertices can be classified in different categories: 


e A primary vertex is the point of collision of the particles in the two 


beams in collider experiments, or the position of the target particle 
in a fixed target experiment. In high-intensity environments more 
than one primary collision can take place close in time (e.g. within 
the same bunch-crossing at a collider). Primary vertices are con- 
strained by the size of the beams and, when applicable, the target. 


A secondary decay vertex is the point where an unstable particle de- 
cays into daughter particles. These can be short-lived particles like 


B or D mesons, or T, in which case the secondary vertex will usually 
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In CMS operation the magnetic field 
is 3.8 T (Superconducting magnets 
are usually operated slightly below 
the maximum current, to minimise 
the risk of a quench, a sudden loss of 
superconductivity). 


The solenoid represents passive mate- 
rial between the electromagnetic and 
hadronic calorimeter. The thickness is 
3.9 Xo. 


The solenoid represented a thickness of 
0.66 Xo at normal incidence [85]. 


Another feature of a toroidal field is 
that in the centre there is no field, which 
has been exploited in the CLAS fixed 
target experiment at CEBAF to allow 
for the use of dynamically polarised 
targets [365]. 
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Figure 10.7: ATLAS magnet sys- 
tems [508]. 


Even though the collisions involve 
quantum mechanical states with non- 
localised wavefunctions, on the scale 
of the detector resolution the concept 
of a well-localised particle collision is 
meaningful. 
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Figure 10.8: H — eeuu candidate event 
at a collision energy of 13 TeV [100]. 
The Higgs boson candidate is recon- 
structed in a beam crossing (beams 
from left and right) with 25 additionally 
reconstructed primary vertices from 
minimum bias interactions. Tracks 
above I GeV are shown. Red lines show 
the path of the two muons, green lines 
show the paths of the two electrons, 
and the yellow tracks are the remaining 
charged particles from the Higgs boson 
candidate vertex. Grey tracks originate 
from other vertices. 


be well before the first tracking station, or long-lived particles like K 
or A, which decay typically within the instrumented volume of the 
experiment. 


e A secondary interaction vertex is the point of the interaction of a 
particle created in the primary collision with material in the detec- 
tor. Examples for such interactions are bremsstrahlung, pair produc- 
tion, and inelastic hadronic interactions. These vertices need to be 
understood, as they otherwise can be a source of mis-measurements. 
The identification of conversion vertices is important for reducing 
the backgrounds to prompt electrons. Photons converting in the 
tracker material can be identified from the tracks of the ete~. The 
identification of bremsstrahlung is used for an improved determina- 


tion of electron energies. This is particularly important in the 
ATLAS and CMS detectors for which 
the total radiation length of the tracking 


detectors is O(1Xo). 
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23007, eR A a eee cL support structures and outside of these 
-300 -200 -100 0 100 200 300 is the first layer of the SCT. Vertices 


x [mm] inside the beam pipe are masked out. 


Vertex reconstruction makes use of reconstructed tracks. It typically 
consists of vertex finding, in which compatible tracks are assigned to a 
vertex candidate, and vertex fitting, where the position of the vertex is 
found, usually by a least square fit, or again by a Kalman filter. Additional A I P 
constraints, either due to the decay kinematics or the beam size can be in- ° ` 
cluded (typically as Lagrange multipliers). 
For short-lived particles the full reconstruction of the decay vertex is 
difficult and can result in loss of efficiency. Therefore, a parameter that 
is often used to determine secondary vertices is the impact parameter of 
a track, which is the distance of the primary vertex to the point on the 
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parameterised track that is closest to the primary vertex (sometimes called 
the perigee). In a cylindrical tracking detector geometry the impact pa- 
rameter can be split into a transverse impact parameter ô, and a longitu- 
dinal impact parameter 6,. We assume that the resolution due to multiple 
scattering is uncorrelated with the intrinsic detector resolution. It can be 
shown (for example in [228]) that the error on the impact parameters can 
be parameterised as 


05,=Ci® a, 
piv'sin® 
and 
05, = C3 Ci 


pV sin? © 

where Cı and C3 parameterise the resolution due to measurement errors in 
the detector and C2 and C4 parameterise multiple scattering. In complex 
detector systems the track is also extrapolated to match it with signals in 
other sub-detectors, most importantly the calorimeters, but also possible 
particle identification detectors or other tracking systems (in particular for 
muons). 
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See exercise 5. 


Tracking 
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Key lessons from this chapter 


Track fitting combines several space points to obtain overall kinematic properties of 
the track from a primary particle. In a magnetic field this includes the momentum, 
which can be determined from the curvature. 


Track reconstruction involves pattern recognition, track fitting and rejection of signals 
that do not belong to the track. 


The momentum resolution perpendicular to the magnetic field can be parameterised 
as 

(07 

-PT — Apr OB, 

PT 
where A depends on the space point resolution of the detector and B on the multiple 
Coulomb scattering (i.e. material budget). 


Secondary vertices can be reconstructed from tracks. Decay vertices of short-lived 
particles can be characterised by the impact parameter. The impact parameter resolu- 


tion can be parameterised as 
/ 


B 
05 =A' O—. 
PT 
Good knowledge of the positions of all the detector elements is required. For the case 
of large tracking detectors in collider experiments, the data can be determined from 
software alignment, provided that the detector is sufficiently stable. 


EXERCISES 


1. Determination of sign of charge for charged particle. 


Assume a tracker with a large number (N > 10) of space points with a point resolution Ox in a uni- 
form magnetic field. 


a) Neglecting multiple scattering, find the maximum momentum for which the sign of the parti- 
cle can be determined with a significance of 3 ø. (Hint: Find the relation between op-1/R™! 


b) 


and o;/s and then use the expression for O; for large n, to find Gp-1.) 


What is this momentum for a tracker with N = 12, a lever arm of L = 1.2 m, and a point res- 
olution of Ooy = 10 um, in a magnetic field of 2 T? What is this momentum for a tracker with 
a similar lever arm, but for N = 50, © = 100 um, and B = | T? Which detector technologies 


could be used to achieve these specifications? 


2. Momentum measurement in a dipole spectrometer. 


A dipole spectrometer consists of a dipole magnet between two tracking detectors of length D. The 
magnetic field is assumed to be uniform over a length L and zero outside. The momentum of an 
incoming particle can be inferred from the deflection angle @ between the straight particle track 
before and after the magnet. Before the magnet the angle of the track to the axis of the experiment, 
which is perpendicular to the magnetic field, is œ, and after the magnet it is B = œ + 0. You can 
assume that all angles are small. 
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Detector Magnet Detector 


D M > D 


a) Show that, if the tracking detectors consist of two tracking layers, separated by the distance 
D, and each with a point measurement error of 0, perpendicular to the magnetic field, the 
error on the slope m of the track measured by one tracking detector is given by 


0, 
On = V2.5. 


b) Use this to find the error on the measurement of a, B and hence on @. 


c) Show that p = 0.3BR, where p is the momentum transverse to the magnetic filed is units of 
GeV/c, B is the magnetic field in T, and R the radius of curvature of the track in the magnetic 
field in m. Find dp/d@, and then, using your result from b), the momentum resolution o,/p. 
For a spectrometer with o, = 100 um, D = 1 m, B= 1.5 T and L = 2 m, what is the momen- 
tum resolution for muons with a momentum of either 10 or 100 GeV/c? 


d) The best estimate for the slope of a straight track measured with N spacepoints with coordi- 
nates (x;,y;) can be obtained from a least-square fit and is given by 


_ Nyi) — Lx Lyi 
m= 
NEX - (Lx) 


where the sums run from 1 to N. Show that the uncertainty on the slope for equally spaced 
measurements from 0 to D, which all have an point measurement accuracy of oy is given by 


? 


o -9 12(N— 1) 
"DV N(N+1)' 


e) Hence show how the resolution of the dipole spectrometer improves when additional detec- 
tion layers are inserted into the upstream and downstream detectors (while maintaining the 
overall depth). What performance drawback that is not covered in these equation will this 
introduce? 


3. Impact parameter measurement. 


The figure shows a B meson produced at primary vertex V; decaying at vertex V2. The impact pa- 
rameter ô for a lepton from the decay is defined in the figure. 


lepton 


Tracking 205 
a) Explain the meaning of L and Y, and derive the relation 


5- cTpB sin 0* 
~ 1+ßcos0*’ 


where c is the magnitude of the velocity of the B meson in the laboratory frame, Tg is the 
proper lifetime of the B meson and 0* is the angle between the direction of the lepton and the 
B meson in the latter’s rest frame. 


b) For a typical B meson lifetime of 1.6 x 107!? s, @* = 2/2 and assuming that the B meson 
takes 80% of the b-quark energy, calculate 6. What quantities must be measured to determine 
6 and what detector components are used for this in a typical collider experiment? 


4. Impact parameter resolution. 


A simple vertexing system at a collider experiment consists of two concentric cylindrical layers 
with radii of Rj = 50 mm and R2 = 70 mm around the collision point, respectively. The point mea- 
surement resolution of this system in the r-@ direction is O, = 10 um. 


a) Show that, neglecting multiple scattering, the uncertainty of the impact parameter (the closest 
distance of the reconstructed track to the beamline in the plane perpendicular to the beam) is 


given by 
[R2 | R2 
RI +R; A 
Rea Ry rọ» 
and evaluate this for the parameters given. How does 05 change if (i) Ri is increased by 


10 mm, (ii) Rı and R2 are both increased by 10 mm? What limitations are their to decreas- 
ing R4, or increasing R2? 


05 = 


b) Assume that each layer has a thickness of 1% Xo. How does multiple scattering affect the im- 
pact parameter resolution? For what momentum would the uncertainty on the impact param- 
eter introduced by multiple scattering equal the uncertainty caused by the point measurement 
error? What other source of multiple scattering would need to be considered in a real experi- 
ment? 


5. Multiple scattering and impact parameter resolution. Consider a cylindrical collider detector with 
a radius of the first layer, R. The first layer has a thickness of s in units of radiation length. Assume 
that the trajectory is accurately reconstructed after the first layer. Consider a particle produced at 
the centre of the detector with a momentum p and polar angle @. Let œ be the multiple scattering 
angle at the first layer. 


a) How does the uncertainty in œ, o(a@) scale with p and 0? 


b) The uncertainty on the transverse impact parameter is o (dọ) = o(a)L, where L is the dis- 
tance from the vertex to the detection point in the first silicon layer. How does © (dọ) scale 
with s, R and 0? 


6. Tag-and-probe method to determine efficiencies from data. 


Consider a sample of muons from single muons triggers which is reconstructed in both the track- 
ing and muon detectors. We can then search for an additional muon in the event using muons 
found using only the tracking detector and for for each pair we calculate the invariant mass. Cuts 
are then made on Muy to obtain a sample of Z —> u` u” events. We can then use the triggered 
muon as the ‘tag’ and the other muon found in the tracking detector as the ‘probe’ to measure the 
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efficiency of identifying a muon in the muon detector (the procedure can be reversed to find the ef- 
ficiency of identifying a muon in the tracking detector). Let the number of probes be Nprobes and let 
the number that pass the muon identification be Npass. 


a) Assume that all muon pairs come from Z — u+ u~ decays. What is the average muon identi- 
fication efficiency? What is the statistical uncertainty? 


b) How can this method be extended to measure the efficiency as a function of muon pr or n? 


c) How can this method be made to work for a real sample of muon pairs which originate from 
an unknown mixture of Z — u* u~ and background events? 


d) In a sample of Z — tL events, in which no charge selection has been made, there are Nss 
same-sign and Nos opposite sign events. Ignore background contributions, what is the prob- 
ability to wrongly measure the charge sign of a muon? Why is the probability of wrongly 
measuring the charge of electrons generally much larger than that for muons? 


1 Calorimetry 


In the previous chapters we have seen how the quasi-continuous small 
energy loss of charged particles enables the detection of these particles 

in tracking detectors, typically made of thin detector layers. A different 
approach is taken in calorimeters, where the particle ultimately is fully 
absorbed. If the particle has a high energy it is lost in a cascade of sec- 
ondary interactions until all particles in the cascade have come to a stop. 
Despite the name, calorimeters in particle physics usually do not measure 
the heat generated in the detector material, as this is usually too small to 
be detected, except at extremely low temperatures, and the typical signal is 
either ionisation charge or scintillation light. 

We will start this chapter by looking at the energy measurement of 
y rays from nuclear decays, as this does not involve the more complex 
effects in electromagnetic cascades that are instigated by high-energy in- 
coming particles. We will see that the resolution of this measurement is 
defined by the counting statistics of secondary quanta in the detector. One 
way to improve this measurement therefore is to rely on quanta that re- 
quire little energy for creation and are thus produced in abundance. Thus 
detection of lattice vibrations (phonons) allows for very good energy reso- 
lution at low energies, but they can only be detected at ultra-cold tempera- 
tures of a few mK. 

For incoming particles with high energies, the secondaries created by 
the interaction with the detector matter have enough energy to create fur- 
ther particles with sufficient energy to create a cascade of particles with 
progressively lower energy. For incoming particles with higher energy the 
number of secondary particles in the cascade will be larger and therefore 
the relative energy resolution improves with energy. We will first discuss 
cascades caused by electromagnetic processes, and consider some exam- 
ples of homogeneous and sampling electromagnetic calorimeters. We will 
then discuss the more complex cascades induced by hadrons and the im- 
plications of this complexity for hadronic calorimeters. We will finally 
look at examples of hadronic calorimeters and the different approaches to 
optimising the resolution. 

As the processes causing the cascades apply for neutral as well as 
charged particles, calorimeters are usually the only type of detector that 
can detect stable neutral particles, and measure their kinematic properties 
(i.e. their energy and direction). 


11.1 GAMMA SPECTROSCOPY 


Photons from nuclear transitions have a well-defined energy between a 
few keV and about 10 MeV. As discussed in section 2.1, these photons will 
interact with the sensor material in a detector, depending on their energy, 
dominantly by photo-electric effect, Compton scattering or pair produc- 
tion. Each of these processes will create free electrons (and positrons) that 
will deposit the energy they accrued from the original interaction of the 
gamma ray, by ionisation and excitation of the sensor material. 


DOI: 10.1201/9781003287674-11 


Historically, in early nuclear physics, 
indeed the heat created in radioactive 
decays was measured (see for exam- 
ple [393 ]). However, this was done for 
the radiation from sources compris- 

ing a large number of particles. The 
name was then applied to detectors that 
detected neutral particles through a cas- 
cade of interactions as described later 
in this chapter. The refinement for par- 
ticle physics is then that the particle’s 
energy is converted into a proportional 
signal. This is the meaning of the term 
in particle physics today. 


We will see that the statistical fluctu- 
ations are actually often smaller than 
expected from pure Poisson fluctuations, 
due to correlations in the creation of the 
secondary quanta. 


One special case of a neutral particle 
that can be reconstructed in a tracking 
detector is a photon that undergoes 
pair conversion in the material of 

the tracking detector, which allows 
reconstruction of the tracks of the e+ e~ 
pair and hence the measurement of the 
y energy and direction. 
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The active detector volumes for the detectors for low energy ys are 
usually too small to guarantee full containment so that, depending on the 
interaction mechanism, not necessarily all the energy of the original pho- 
ton will be deposited in the detector, and the measured spectrum contains 
several characteristic features: 


e Photopeaks are full energy peaks, where the photon transfers all its 
energy to an electron in the detector material by the photoelectric 
effect. The range of the electron is typically smaller than the size of 
the detector, and thus all the energy of the incoming y is collected. 


e The Compton edge is the upper endpoint of a continuum of de- 
posited energies from electrons released by Compton scattering. 
The Compton edge energy is given by eq. (2.11). While the electron 
from the Compton scattering has a short range and usually deposits 
all its energy in the detector, the scattered photon often does not 
interact again in the detector, and this results in the continuum of 
recorded energies. 


e If the photon interacts by pair-production the resulting electron- 
positron pair will share the energy available after their creation, 
Ey- 2mec°. Both will lose their energies in the detector material by 
ionisation and excitation until they come to a stop. The positron will 
then annihilate with an atomic electron, creating a pair of photons, 
each with an energy of mec? = 511 keV. The probability for these 
photons to interact in the detector is limited, causing single-escape 
and double-escape peaks that are 511 keV and 1022 keV below the 
photopeak, respectively. 


In addition, there can be features that originate in the material surrounding 
the sensor, typically from photons generated in the surrounding material: 


e There can be a symmetric reverse Compton edge, which is caused 
by photons that have been backscattered into the sensor from sur- 
rounding material, and then deposited their remaining energy by the 
photoelectric effect. 


e When the photons undergo a photoelectric effect in surrounding 
materials, the emitted electron leaves a vacancy that gets filled 
by another electron with the emission of an X-ray with an energy 
characteristic for the surrounding material. This can be captured by 
the sensor, resulting in X-ray peaks with energies of a few to a few 
ten keV. 


e The photon can also interact by pair-production in the surrounding 
material, in which case a 571 keV photon from the positron annihila- 
tion can be backscattered and detected in the sensor. 


The finite size and the material affect the efficiency of the detector, 
which also depends on the photon energy. 

Common gamma spectrometers are either based on scintillation in 
crystals like Nal(Tl), BGO, or LaBr3:Ce, read out by photomultiplier 
tubes (PMTs) or silicon photomultipliers (SiPMs), or they can be based 
on high-Z semiconductors like Ge, CdTe, or CZT. Scintillation detectors 
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For high-energy calorimeters as dis- 
cussed later, these features will not be 
visible. 


Due to the finite energy resolution of the 
specific detector some of these features 
may not be visible in the measured 
spectrum. 


In solids the range of a 1 MeV electron 
is typically less than 1 mm. 


In nuclear spectroscopy sometimes an 
‘anti-Compton shield’ or ‘Compton- 
suppression shield’ is used, which is 

a thick scintillator surrounding the 
gamma detector, which vetoes events in 
which a photon gets scattered out of the 
detector into the scintillator. 


Again, in addition to the geometry and 
type of material around the sensor, 

the height and width of these features, 
and whether they are visible at all, 
will depend on the size, efficiency and 
resolution of the detector. 


For germanium, for example, the attenu- 
ation length varies from about 30 um at 
10 keV to about 7 mm at 10 MeV. 


See section 5.3. 
See sections 4.4 and 4.6. 


See section 9.13. 


Calorimetry 


can easily be produced in large size, but have a somewhat poorer energy 
resolution because of the large energy required to produce a scintilla- 

tion photon. Germanium can also be produced in large, pure crystals, but 
has a small band gap, which requires cooling by Liquid Nitrogen (LN2), 
whereas the type II-V semiconductors have a larger band gap, but are dif- 
ficult to produce in larger size. 
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The best resolution for scintillators is 
achieved for LaBr3:Ce. 


A review of the use of Ge detectors 
for gamma spectroscopy can be found 
in [229]. 
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Figure 11.1: Gamma spectra from © Co. Left: Spectrum from a LaBr3:Ce scintillator [412], Right: Spectrum from a Ge detector [312]. ‘SE’ and 
‘DE’ are the single and double escape peaks, respectively, for the 1332 keV peak. “°K’ is a background peak from naturally occurring radioactive 
49K, ‘backscatter’ corresponds to the reverse Compton edge. Data above 1400 keV are from double-event pile-up due to a finite integration time of the 


sensor readout. 


The energy recorded in the detector material is the sum of many in- 
dividual energy transfers. However, usually only a fraction of the energy 
will end up as detectable signal, with the rest ultimately ending up in heat. 
As the energy transfer is quantised by the finite energy required to create 
an electron-ion or electron-hole pair or a scintillation photon, there will 
be a finite, integer number of energy transfers, the exact number of which 
will vary from event to event. A simple estimate of the energy resolution 
can therefore be obtained from Poisson counting statistics. 

The best energy resolution is achieved with cooled Ge detectors, 

0.1% for photons of G(1 MeV). This is smaller than the resolution ex- 
pected if the pulseheight would be Poisson-distributed. The improved res- 
olution is due to the fact that the individual electron-hole pair creations are 
not uncorrelated. In practice, the reduction can be parameterised using the 
Fano factor F (see section 2.6), as 


OF On FAg 
FP ae P= 
VF E / Poisson VE, E 


with 0 < F < 1, and Ag the energy required to create a secondary quan- 
tum. The value of the Fano factor can be predicted from Monte Carlo sim- 
ulations. For semiconductors it is found to be about 0.1. 

In scintillators, the energy used to create the signal, scintillation light, 
is a small fraction of the energy deposited by the photon, hence the Fano 
factor is close to 1. 


(11.1) 


It takes about 2.9 eV to create an 
electron-hole pair in Ge. So, for 

a 1 MeV photon, N ~ 10° /2.9 

and from Poisson statistics 

ov /N =1//N ~1.7x 10°. 

For the energy resolution 

o¢/E = on/N, not including any 
other sources of experimental errors. 
For these see for example [394]. 


Fluctuations in the internal amplifica- 
tion process in a PMT, as is often used 
to detect the scintillation photons, will 
degrade the resolution even further. 
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11.2 SUB-KELVIN CRYOGENIC DETECTORS 


As noted above, in the common detectors collecting ionisation charge 
or scintillation light a significant fraction of the energy of the incoming 
particle will be lost to heat, and the fluctuations in the partition of the en- 
ergy drive their energy resolution. 

At a microscopic level the heat manifests itself as lattice vibrations, 
or phonons. At room temperature phonons from particle interactions in 
the detector material cannot be detected above the background of ther- 
mally created phonons. However, at very low temperatures (typically well 
below 1 K) this background is sufficiently depressed to allow for their ob- 
servation. In addition, the low energies required for the creation of these 
quasi-particles result in low detection thresholds, and the operation at low 
temperatures reduces electronics noise. 

A signature of the heat deposited in the detector is an increase of the 
temperature due to the heat capacity of the material. At low temperatures 
the heat capacity of an insulator can be described by the Debye model, 


_, lantR ( T \? 

A Gr 

with the Debye temperature Tp and 12”*R/5 ~ 1944 JK~!mol"!. 
The small heat capacity at very low temperatures makes the heat from 
particle interactions detectable. For example, an energy deposition of 

10 keV increases the temperature of 1 kg of silicon at 10 mK by 6 uK. 

The typical arrangement to measure heat is a bolometer, consisting of 
a block of detector material with finite heat capacity C that is connected 
through a thermal impedance Rj, with a heat sink at constant tempera- 
ture. A deposition of the energy Q results in a temperature spike with 
AT = Q/C, which decays with a time constant given by T = RC. The 
lifetime of thermal phonons in the detector is given by this time. The time 
required for the phonons to thermalise and the time required to restore the 
temperature limits the counting rate of thermal calorimeters to a few Hz. 

Fluctuations in the total number of phonons in the absorber have a 
variance C/kg, which yields a fundamental limit of the energy resolution 
of (E 2) = kgT?C, which is independent of the energy and proportional to 
the heat capacity of the absorber [361]. An absorbed heat P will raise the 
temperature by AT = PR. Fluctuations in the power flowing to the sink 
have a spectral density dp*/df = 4kgT? /Rm. Larger thermal impedance 
leads to lower noise power, but also to longer response time and reduced 
heat removal capability. 

The most common phonon sensors to measure the temperature in- 
crease are resistive thermometers, for which the resistance changes as a 
function of temperature. Examples are semiconducting thermistors or su- 
perconducting transition edge sensors (TES). 

A thermistor is a heavily doped semiconductor slightly below the 
metal-insulator transition. Good doping homogeneity and reproducibil- 
ity is required, and can be achieved with Neutron Transmutation Doped 
(NTD) devices. Germanium with NTD (Ge-NTD) is a commonly used 
sensor material. The typical resistance at the working point is between 
1 and 100 MQ. 


(11.2) 
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For a more detailed summary of cryo- 
genic detectors, see [505]. 


Such detectors are also called ‘non- 
equilibrium’ detectors, as the signal is 
acquired before the deposited energy 
is transferred to other, undetectable, 
channels (e.g. heat). 


Phonons can be interpreted as quasi- 
particles with energies down to meV 
(thermal phonons). Other low-energy 
quasi-particle states that can be used 
for particle detection are for example 
superconducting Cooper-pairs. 


In a metal there is a contribution from 
the electron heat capacity that is pro- 
portional to T, which dominates at 
temperatures below I K. 


Table 11.1: Debye temperature in 
common cryogenic detector materials. 


Material Tp [K] 
Silicon 645 
Germanium 374 
Nal 165 
CaWO4 355 
LiF 735 
Al203 1042 


The classical bolometer is used for the 
measurement of continuous (infrared) 
radiation. In this section, we are con- 
sidering energy depositions by discrete 
particle passages. It would therefore be 
more correct to call this a ‘calorimeter’, 
but the designation ‘bolometer’ has 
stuck. 


Strictly speaking, temperature is an 
equilibrium concept and is not appli- 
cable until complete thermalisation of 
the phonons. However, the principles 
outlined in this section still apply in the 
near-equilibrium case. 


These expressions constitute the limits 
of resolution for detectors that have 
reached thermal equilibrium. If there is 
insufficient time to allow for complete 
thermalisation, then eq. (11.1) applies. 


The resistance of a semiconductor 
thermistor at a temperature Ty can be 
described by R(T) = Roexp(\/To/T), 
where Ro is the resistance at Ty. 


NTDs can be produced by thermal 
neutron irradiation of Ge sensors to 
produce dopants like Ga, As and Se. 
This method can result in very uniform 
doping [360]. 
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A TES consists of a superconducting layer that is operated at a tem- 
perature close to the transition between normal and superconductivity. 
This transition occurs over a short temperature range, which makes the 
device highly sensitive to temperature changes. The temperature range of 
the transition is small and thus the sensor has to be operated at a very spe- 
cific temperature. A simple readout scheme uses the change in resistance 
to steer the current in an inductance, causing a very small change in the 
magnetic flux that can be measured by a superconducting quantum inter- 
ference device (SQUID). 
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The sensitivity of a temperature sensor can be described by the loga- 
rithmic temperature sensitivity & = dlogR/dlogT = (T/R)dR/dT. For 
semiconductor thermistors a@ is typically around 5 or 6, and for TESs it is 
two orders of magnitude higher. In a TES the energy required to break up 
a Cooper pair in the superconductor is of the order of meV, and the Fano 
factor typically about 0.2, and thus sub-K calorimeters with TES readout 
can achieve the best calorimetric low-energy resolutions, down to eV for a 
signal of several keV. 

This resolution is often difficult to achieve in practice, due to imper- 
fections of the device material. In particle physics, sub-K detectors have 
been employed for precision measurements of the neutrino mass, and the 
search for neutrinoless double beta decay or weakly interacting massive 
particles. However, scaling up the detector technology to the large active 
detector volumes required for these searches is challenging, and today 
multi-ton noble liquid detectors are a strong contender in this field (see 
section 8.1). 


11.3 ELECTROMAGNETIC CASCADES 


In particle physics photon energies are often significantly higher than 

in nuclear decays, and the electrons (and positrons) created in pair- 
production dominant at these photon energies will have an energy above 
the critical energy, above which radiative energy loss will dominate over 
ionisation and excitation energy loss. The result is a cascade with an in- 
creasing number of electrons, positrons and photons, with decreasing en- 
ergy as the shower develops. At the beginning, the shower will be domi- 
nated by the sequence of pair production and bremsstrahlung, but as the 
shower progresses, the secondaries will become softer and the probabil- 
ity for the creation of lower energy photons increases. For photon ener- 
gies below a few MeV pair production stops, and the number of positrons 
ceases to increase, while more free electrons continue to be produced in 
Compton and photoelectric scattering events. At the same time, once the 
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Table 11.2: Transition temperatures for 
common superconductors in TES. 


Material T, [K] 
Tungsten 0.012 
Iridium 0.14 
Titanium 0.39 
Molybdenum 0.92 
Aluminium 1.14 


The readout uses a feedback circuit 
which actively adjusts the temperature 
using Joule heating. 


Figure 11.2: Principle of a TES 

(after [410]). Left: Resistance as a func- 
tion of temperature in a superconductor 
close to the transition temperature. 
Right: Simple readout circuit. Ry is a 
load resistor with fixed value. 


Bolometric detectors are in wide use 
in astrophysics, for example in detec- 
tors to study the cosmic microwave 
background. 


See section 2.1. 


The bremsstrahlung photon energy 
spectrum is shown in Figure 2.34. 


Typically, about three quarters of the 
charged particles in the electromagnetic 
cascade are electrons, most of them with 
low energies (<1 MeV). 
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energies of electrons and positrons drop below the critical energy, creation 
of new photons by bremsstrahlung becomes insignificant. At this point the 
shower will cease to grow, although the particles in the shower will con- 
tinue to lose their energy to ionisation and excitation, until all the energy is 
absorbed. 

The signal in the calorimeter is generated by the ionisation or excita- 
tion energy loss of the charged particles in the shower. The overwhelming 
majority of the charged particles depositing this signal are soft (for mate- 
rials with large Z on average 40% of the energy is deposited by particles 
with an energy below | MeV [497]), and their range in the calorimeter ma- 
terial is short. 

A minor role is played by photonuclear interactions [117]. The cross- 
section for such interactions peaks at a few 10 MeV due to the ‘giant 
dipole resonance’, but is still typically only about 107? of the cross- 
section for pair production. A consequence of these reactions is the pres- 
ence of neutrons from nuclear de-excitation in the cascade (see also Fig- 
ure 11.13). 

Similar electromagnetic cascades will be instigated by high-energy 
electrons or positrons hitting a sufficiently deep block of matter. Muons 
will only cause electromagnetic showers at much higher energies, be- 
cause of the lower rate of bremsstrahlung due to their larger mass. Simi- 
larly, charged hadrons will not directly initiate electromagnetic showers 
by bremsstrahlung, but they will will start a hadronic cascade (which will 
include an electromagnetic component) as discussed in section 11.9. 


11.4 ELECTROMAGNETIC SHOWER DIMENSIONS 


As discussed in section 2.1, the dominating interactions for high- 
energy particles in the initial stages of the cascade, pair production and 
bremsstrahlung, do have a common characteristic length scale, the radi- 
ation length Xo. It thus makes sense to express the longitudinal develop- 
ment of the shower in this unit. 

Because the number of charged secondaries that are generated in high- 
energy electromagnetic interactions per Xo is small (~1 to 2), a signif- 
icant number of Xp are required to contain the shower. For example, in 
lead 25 Xo are required to contain 99% of a shower of 100 GeV. An in- 
coming high-energy photon will on average undergo a pair-production 
event after 9/7 Xo, which will divide its energy between the electron and 
the positron. Hence, for a photon of twice the energy, the depth of the 
shower will increase linearly by such a distance. Thus, the depth of a 
high-energy shower scales logarithmically with the incoming energy. To 
achieve shower containment in an experiment the size of the calorimeter 
needs to scale logarithmically with the scale of the energy of the experi- 
ment. Thus calorimeters are powerful detectors in particular for high en- 
ergy experiments. 

The scaling of the shower depth with the radiation length is not per- 
fect. In materials with higher Z the critical energy is smaller. Hence, the 
increase in the number of shower particles due to pair production extends 
to lower energies, and thus to later stages in the shower development. In 
these materials the shower maximum will occur later, and the shower tail 
is longer. 
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Tonisation in this section is understood 
to include Cherenkov radiation in 

the material, if the conditions for its 
emission are met. 


In solids the range of a 1 MeV electron 
is less than 1 mm. 


The giant dipole resonance (GDR) is 

a high-frequency collective excitation 
of atomic nuclei, typically a coherent 
movement of all neutrons against all 
protons. It can decay by nuclear fission, 
or emission of neutrons or gamma rays. 


At extremely high energies (>10°° eV), 
the cross-section for e+ e~ pair- 
production decreases due to the 
Landau-Pomeranchuk-Migdal (LPM) 
effect [60, 311], whilst the cross-section 
for photonuclear interactions increases, 
so that cascades at these energies are 
predominantly hadronic. 


A simple model for the initial stages of 
an electromagnetic cascade is discussed 
in exercise l. 


For comparison, to maintain the mo- 
mentum resolution in a spectrometer the 
size of the spectrometer needs to scale 
with L?, where L is the lever arm of the 
spectrometer (see section 10.4). 


While 25 Xo are needed for 99% con- 
tainment in lead, in iron 21 Xo and in 
aluminium 18 Xo are sufficient. 
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Laterally, in the early stages of the shower the expansion of the shower 
is driven by multiple scattering of the charged high-energy secondaries. 
The scale length for this is called the Moliére radius, which can be found 
from integration of eq. (2.53), 


R 21.2 MeV 
MS y 


Several versions of this equation using approximate expressions for Ee 
and Xo are in use. Typically, 90% of the deposited energy of a shower is 
contained in a cylinder with a radius of one Ry. 

In the later stages of the shower development the low-energy photons, 
which have a significant probability to be Compton-scattered by larger 
angles but still have a significant range, cause a broadening of the tail of 
the shower. 


Xo. (11.3) 


11.5 ENERGY RESOLUTION OF HOMOGENEOUS 
ELECTROMAGNETIC CALORIMETERS 


In homogeneous calorimeters the whole volume is actively contributing 
to the signal from the detector. Typically, homogeneous calorimeters are 
made from transparent crystals that scintillate or produce Cherenkov ra- 
diation, or from liquid noble gases where the ionisation electrons are col- 
lected in a homogeneous electric field. The readout elements for homo- 
geneous calorimeters (PMTs or APDs for photon readout, or electrodes 
and electronic readout for ionisation chambers) are usually outside of the 
acceptance or behind the calorimeter. 

Similarly as in section 11.1, in high-energy calorimeters the signal, 
which is proportional to the energy of the incoming particle, is the sum 
of a large number N of more or less independent small energy transfers 
(to create ionisation charges or scintillation photons, etc.). Using similar 
arguments, we again expect that stochastic fluctuations of the measured 
energy can be described by 


oœ AN yN 1 1 
E N N W VvE 


1/2 


and indeed a stochastic term ~E~ /4 is usually a major contribution to the 
energy resolution of a calorimeter. Other contributions to the stochastic 
term come from particles in the cascade escaping detection, because they 
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Figure 11.3: Electromagnetic shower 
dimensions calculated using Monte 
Carlo simulation [496]. Left: Longi- 
tudinal profiles of 10 GeVe~ showers 
developing in aluminium (Z = 13), 
iron (Z = 26) and lead (Z = 82). 
Right: Radial distributions of the 
energy deposited by 10 GeV electron 
showers in copper at various depths. 
Monte Carlo techniques to simulate 
electromagnetic showers will be dis- 
cussed in section 11.12. 


Note that the radius of containment is 
thus independent of the energy of the 
incoming particle. The Moliére radius 
is also only weakly dependent on Z, 
as both radiation length and critical 
energy scale approximately with Z~'. 


Thin electrodes for ionisation chamber 
readout can sometimes be inside the 
active volume, resulting in a quasi- 
homogeneous geometry (see for exam- 
ple [244]). 


For the exact proportionality factor 
again statistical constraints will need 
to be considered in the form of a Fano 
factor, while additional stochastic 
contributions, for example from the 
readout, will degrade the resolution. 


214 


traverse sections without instrumentation or because the shower is not 
completely contained. Typically, in homogeneous calorimeters a stochastic 
term of a few %/,/E /GeV is achieved. 

More generally, the energy resolution of a calorimeter can be parame- 
terised as 

OE A B 

sE 

E JVE E 
The first term contains the stochastic contribution, and is often (but not 
always) the most relevant. The second term, which corresponds to an 
energy-independent error typically is caused by readout noise. The final 
term is associated with inhomogeneities in materials, calibration errors 
and some forms of radiation damage. 

An example for a large homogeneous electromagnetic calorimeter sys- 
tem is the CMS calorimeter [196], which uses PoWO; scintillating crys- 
tals with APD readout. The properties of PoWO; as a scintillator have 
been discussed in section 5.3. This material was chosen for the homoge- 
neous electromagnetic calorimeter in CMS because of its short radiation 
length, its high light yield (LY), the high speed, and good uniformity and 
radiation tolerance. 

The design energy resolution in the barrel part of this detector using 
the parameterisation in eq. (11.4) is A/VE = 2.7%/,/E/GeV , which 
is dominated by photostatistics due to fluctuations in the amplification 
in the APD (2.3% /,/E/GeV ), B = 210 MeV at high luminosity opera- 
tion, which is dominated by electronics noise (150 MeV), and C = 0.55%, 
which is dominated by calibration effects (0.4%) [199]. The performance 
was studied with prototypes in testbeams, where stochastic, noise and con- 
stant terms of A/VE = 2.8%/,/E/GeV , B = 127 MeV and C = 0.3% 
were observed [22]. However, in LHC operation there are additional ef- 
fects which will degrade the resolution; ‘pile-up’ (multiple pp collisions 
in the same bunch crossing) and radiation damage. In addition it is more 
difficult to achieve a small value of the constant term in a large calorimeter 
than in a small calorimeter used in a testbeam. 

The main effect of radiation damage in PoWO; is from ionising ra- 
diation [273]. When PoWO; crystals are exposed to ionising radiation 
pre-existing point-structure defects may act as traps for electrons or holes. 
These defects act as colour centres that absorb light strongly over some 
range of wavelengths, which reduces the attenuation length for the scintil- 
lation light. The light absorption coefficient grows linearly with the dose 
rate. The attenuation results in an increase of the constant term, which 
becomes the dominant contribution to the energy resolution (10% after 
irradiation by 1.3 x 10!4 p/cm?) [19]. 

The effect of neutron damage was studied using nuclear reactors, but 
the observed damage was attributed to ys, not neutrons. Fast hadrons gen- 
erate locally extended defects in the crystal. This results in a characteristic 
variation of absorption with wavelength, u œ A~*. There was no observed 
effect on the production of scintillation light. 

As the radiation damage only effects the light transmission, a very 
strong correlation between measured light yield and absorption was ob- 
served [273]. Therefore the effects of radiation damage can be monitored 
and corrected for by in-situ measurements of the absorption length. This 
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It is common to express the coefficients 
A, B and C in this expression in percent, 
assuming that the energy is given in 
GeV. 


We keep the designation ‘stochastic’ 
for the term proportional to 1/\/E even 
though the source for the other terms 
can also be stochastic. 


For APDs see section 9.9. 


The LY needs to be sufficiently high 
so that the stochastic term in the 
energy resolution is small enough 

to obtain very good resolution for 
high energy photons and electrons 

(E >~10 GeV). The LY for PhWO, is 
more than 100 photons/MeV. 
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Figure 11.4: Design energy resolution 

for the CMS electromagnetic calorime- 


ter [199]. 


The effect of radiation damage on the 
attenuation length of PhW O; crystals 
can be mitigated by specified impurities 
such as La, Y, and Nb [273]. 


This is the expected fluence for the CMS 
endcap calorimeter after an integrated 
luminosity of 3000 fb~'. This corre- 
sponds to the expectations for HL-LHC 
and is an order of magnitude larger 
than that expected for LHC phase 1. 
Therefore for the phase 2 upgrade 

this calorimeter will be replaced by a 
lead/silicon calorimeter [200]. 
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was achieved using lasers with a wavelength of 420 nm (the peak of the 
scintillation spectrum) to inject light into each crystal [196]. The resulting 
signal was measured using the front-end electronics, and the laser power 
using pn silicon photodiodes. The stability of this system was also con- 
trolled by using a laser at A = 796 nm (no significant change in attenuation 
is expected at this wavelength). 

Homogeneous electromagnetic calorimeters have also been made us- 
ing lead glass. For this type of calorimeter, the signal comes from the 
Cherenkov light generated in the glass by the charged secondaries in the 
shower, which can be detected by PMTs or other sensitive photon detec- 
tors. The light yield from Cherenkov radiation is much lower than that 
from scintillation in inorganic scintillators. Hence, the electron energy res- 
olution is typically inferior to that of homogeneous crystal calorimeters 
that are relying on scintillation. However, lead glass has been chosen for 
homogeneous calorimeters for its low cost and ease of handling and manu- 
facture compared to crystal scintillator materials. 

A large lead glass calorimeter was employed in the OPAL experi- 
ment, where the energy resolution for testbeam prototypes of the barrel 
calorimeter without material in front was og /E = 6.3% /\/E/G&V +0.2% 
between 5 and 100 GeV [392]. A more recent example is the lead glass 
component of the PrimEx calorimeter, for which an energy resolution of 


2 4 
op JE = 23% p34% 


E/GN ~ JE JGN 


was measured in the energy range from 1 to 5 GeV [322]. 

In addition to the energy measurements calorimeters sometimes also 
need to provide information about the position of the shower (for track 
shower matching, or to reconstruct the 4-momentum of a neutral parti- 
cle). In general, the position resolution has a similar energy dependence 
as the energy resolution, and in particular is often dominated by a 1/ VE 
term caused by stochastic fluctuations. The scale of the position resolu- 
tion is given by the Molière radius Ry or the cell size of the calorimeter, 
whichever is larger. 

Finally, for some calorimeters also the time of the shower signal can 
be important, if it is needed to correlate the shower with signals in other 
parts of the experiment. The time resolution in that case can usually also 
be described by a similar functional form as in eq. (11.4). 

It should be noted, that even though resolution, in particular for en- 
ergy, is used as a benchmark number for calorimeters, other considerations 
also influence the choice of design, like the energy range of the particles 
to be detected, space, the amount of material in front of the calorimeter, 
complexity of readout, ease of construction and/operation, and cost. Res- 
olution numbers are often obtained in dedicated test beams of prototypes 
that are often performed under highly idealised conditions, and the per- 
formance of the calorimeter in the actual experiment might differ signifi- 
cantly. It is also common practice to quote for the resolution the standard 
deviation of a Gaussian fit to the measured energy distributions, thus ig- 
noring any non-Gaussian tails. 
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Ordinary glass has a refractive index of 
n~ 1.5. The addition of lead results in 
an index of refraction of up to ~1.7. 


We have seen in 2.8 that Cherenkov ra- 
diation constitutes only a small fraction 
of the charged particle energy loss. 


Hence the cell size in electromagnetic 
calorimeter is often chosen to be similar 
to Ry. 


The idealised conditions in a testbeam 
often allow to make geometrical selec- 
tions, or to use calibration procedures 
that might not be available in the real 
experimental conditions. 
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11.6 SAMPLING CALORIMETERS 


To minimise the depth of the material required to contain a shower, 
thick layers of a dense solid (usually a metal) are desirable as absorber. 
However, in general these dense absorbers do not provide a measurable 
signal as dense materials are usually opaque and conductive or impure 
insulators. Hence, a common approach is to intersperse layers of dense 
absorber with less dense active detector layers. This is called a “sampling 
calorimeter’. Because of the size of the required instrumentation, the typ- 
ical readout technologies are scintillators in planes or fibres, ionisation 
gaps filled with liquid noble gases, or more recently, semiconductors. 
Semiconductors in the past have been considered prohibitively expensive 
for the use in calorimeters, but as will be discussed in section 11.13 there 
are benefits from the fine segmentation such readout can provide, which 
leads to their consideration in modern detector systems. Wire chambers 
are less useful, because of the small amount of active material, resulting in 
significant Landau fluctuations, and additional fluctuations from the inter- 
nal avalanche amplification process. 

The price to pay for the compactness are additional fluctuations of the 
energy deposited in the detector layers, which increase the stochastic term 
in the energy resolution (these are called ‘sampling fluctuations’). As dis- 
cussed, in an electromagnetic calorimeter most of the energy is deposited 
by low energy charged secondaries. Because of the low energy these have 
a range of less than 1 mm and their directions are random. They are typ- 
ically produced by low-energy photons and, in particular in high-Z ab- 
sorbers, these photons will also have a very limited range due to the strong 
dependence of the absorption cross-section on Z. Consequently, most of 
the detected signal in the sampling calorimeter will be generated within 
a thin layer at the absorber/detector interfaces in the detector gap of the 
sampling calorimeter. The resolution will still be dominated by Poissonian 
fluctuations of the number of deposited energy quanta, but the number of 
these quanta is now limited by this geometrical acceptance reduction. 

Several fit functions to predict the energy resolution of electromag- 
netic sampling calorimeters have been proposed, however, none of these 
are completely satisfactory!. Qualitatively, the following points emerge: 


1A parameterisation that is common in the literature is og /E = ,/AE/E, where AE is 
the energy loss in one absorber/detector layer (often the equation is written for E in GeV and 
AE in MeV with a corresponding scale factor of 3.2%). This equation is flawed, because it 
relies on Rossi’s ‘approximation B’ [432] (see exercise 1), without taking into account the 
energy scale of energy deposition in the electromagnetic cascade and the range of particles 
responsible for it, and it does not reproduce experimental observations on the dependence of 
the resolution on the detector layer thickness (for a detailed discussion see [497]). 

A modification of this expression has been given in [53], which takes into account the 
reduction of the total path length of all particles in the shower due to a finite cut-off for the 
minimum kinetic energy Egy of an electron (positron) that can be detected in the calorimeter, 
and the effect of different angles of the charged secondary tracks in the detector layer, 


OE gay, AE /MeV 1 
= 3.2%. -= 
E F (Ec, Eu) (cos 0) VE/Gev 


3.2% E/M 


where F (Ee, Ecut) is the correction factor for the total path length, and (cos @) the average 
of the cosines of the angle of the charged secondary tracks. Expressions for both factors are 


t 
F (Ec, Ecu) (cos 0) |r (11.5) 
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This section deals with electromagnetic 
sampling calorimeters. For the use of 
sampling in hadronic calorimeters see 
section 11.8. 


Typically, the absorber layers will have 
a high Z, whereas the detector layers 
contain lower Z elements. 


Although the obvious choice is to ar- 
range these in layers, other geometries 
also work, as long as size of the sam- 
pling is smaller than the shower size. 


The energy response of a sampling 
calorimeter is given by the measured 
signal in the active layers, and not by 
the energy deposited over a sampling 
layer. The two are generally not equiva- 
lent. 


See section 2.1. 
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e The energy resolution of electromagnetic sampling calorimeters is 
usually dominated by a stochastic term og /E = A/ VE. The propor- 
tionality constant A is larger than for homogeneous electromagnetic 


calorimeters, typically between 5%/\/E/GeV and 20%/,/E/GeV . 


e Fora larger sampling fraction, if the detector thickness is kept con- 
stant and the same absorber thickness is maintained to achieve the 
same level of containment, the energy resolution improves, as this 
increases the number of interfaces between absorber and detector. 


e For the same reason, a higher spatial sampling frequency improves 
the resolution. 


e It also improves for absorbers with a higher Z value, as this in- 
creases the number of particles in the shower. 


A semi-empirical parameterisation that aims to describe the reduction 
of the counting statistics due to the geometrical arguments has been pro- 
posed by Livan et al. [341] 


OF ddet/mm 1 
= 2.7% , (11.6) 
E ° Scamp V/ E/G 


where daet is the thickness of the detector layer (fibre diameter for scintil- 
lating fibre calorimeters) and fsamp the sampling fraction. The numerical 
front factor has been derived from a fit to the performance published for 
several electromagnetic sampling calorimeters. For these data the result of 
this parameterisation is within 10% of the actual value. 
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Eq. (11.6) can be used for sampling fractions between 1 and 10%, and 
a sampling layer thickness between 0.1 and 1 Xo. For thicker detector lay- 
ers the assumption that the dominant contribution to the signal comes from 
secondaries produced in the absorber is not valid any more and the statis- 
tics of particles generated in the detector layer itself becomes relevant. For 
very thin detector layers, and this applies to calorimeters read out with 


given in [53]. The last equality uses another result from ‘approximation B’, AE/E = tE; /E, 
where ż is the thickness of a combined absorber/detector layer in radiation lengths. However, 
the modifications do not fully address the flaws in the original expression. Nevertheless, this 
equation is widely quoted, with various omissions and approximations. 
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For sampling calorimeters the stochas- 
tic term is often called the ‘sampling 
term’. 


The sampling fraction is the ratio of 
the energy loss in the detector layer 
and the energy loss in the combined 
absorber/detector layer. 


A higher spatial sampling frequency can 
be achieved if the sampling is divided 
into finer units, while maintaining the 
same total material for absorber and 
detector, respectively. 


It might seem counterintuitive that 
the resolution improves with a thinner 
detector layer, but the key constraint 
here is that the sampling fraction is 
maintained, so that this only increases 
the spatial frequency of sampling layers. 
For practical purposes this parameter- 
isation uses the sampling fraction for a 
MIP, dg \ det 
eet ( d ) mip 


dei abs ` 
daei (E) pip + abs (E) mip 


Despite the correlation with data 

eq. (11.6) also appears incomplete, 

as it does not contain a dependence 

of the resolution on Z, and the inter- 
changability of fibre diameter and plate 
thickness appears fortuitous. 


Figure 11.5: Stochastic term of the 
energy resolution for different sampling 
calorimeters as a function of the ratio 
daet / feamp [342]. The energy in the 
figure is given in GeV. The line shows 
the parameterisation given in eq. (11.6). 
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typical gaseous or semiconductor detectors, the energy resolution does de- 


grade less strongly as a function of the sampling fraction than predicted by 


this equation [497]. 


EXAMPLES OF ELECTROMAGNETIC SAMPLING CALORIMETERS 


In this section we will discuss three different sampling calorimeter exam- 
ples covering a wide range of sampling fractions and different detection 
mechanisms. 

An example of a sampling calorimeter with a very large sampling 
fraction and scintillating fibre readout is the KLOE calorimeter [20]. 
There the requirement was for good energy resolution for photons from 
Ki s > n?n? decays. The energies of photons generated in these decays 
are low (20 < E < 510 MeV), so that one requirement was to minimise 
material in front of the active volume. Full solid angle coverage was re- 
quired, calling for a compact calorimeter. This motivated the choice of a 
sampling calorimeter with lead absorber and scintillating fibre readout. 
The fibres were 1 mm diameter with a spacing of 1.35 mm, resulting in 
a very high sampling fraction (lead/fibre/epoxy 42/48/10 by volume). 

The energy resolution therefore was excellent, with a stochastic term of 
5.7% /\/E/GeV and a negligible constant term. This calorimeter also 
achieved a good time resolution (54 ps/,/E/GeV ). 

The ATLAS electromagnetic calorimeter consists of lead absorber 
plates in a a zig-zag (‘accordion’) shape with gaps filled with Liquid Ar- 
gon (LAr) as the active medium read out in an ionisation chamber geome- 
try [109]. 

Liquid argon was an attractive option for the active layer of an elec- 
tromagnetic calorimeter at LHC because it is inherently radiation-tolerant. 
A challenge associated with this approach is the long drift time. In AT- 
LAS the width of the gap for the LAr is typically 2 mm, which results in 
a maximum drift time of 0.45 us [108], which is much longer than the 
time between bunch crossings (25 ns). Therefore, readout with a slow am- 
plifier would result in too much noise from ‘pile-up’ background (energy 
deposited by particles in other bunch crossings). This is addressed by the 
use of a fast pre-amplifier followed by an RC — CR? filter to provide a bi- 
polar shaper (the RC acts as a differentiator and the CR? as an integrator). 
The electronics noise decreases with the rise time but the pile-up noise 
increases with it (this is the same effect as discussed for semiconductor 
detectors in chapter 9). The rise time selected is longer than the time be- 
tween bunch crossings as further digital filtering is performed by the re- 
ceiver electronics. The overall filtering allows the assignment of an energy 
deposition to a particular bunch crossing. The bi-polar pulse shaping im- 
plies that there is no net DC effect on the baseline current, independent of 
the level of pile-up background. A drawback of this shaping is that it re- 
duces the size of the integrated charge signal and therefore requires very 
low noise electronics. 

In a ‘classical’ lead/liquid argon calorimeter, in which the absorbers 
are plates perpendicular to the trajectories of the incoming particles, 
the signals have to be read out by long cables. This results in a large ca- 
pacitance which would increase the noise (see chapter 3), and a large 
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Figure 11.6: Energy resolution of 

the KLOE electromagnetic sampling 

calorimeter as a function of the en- 

ergy [20]. 


Figure 11.7: Current and integrated 
charge from electron drift in the ATLAS 
EM calorimeter (top). The effect of 
pulse shaping with a bipolar shaper is 
shown for a -function pulse (middle) 
and for the triangular current pulse 
expected from a uniform ionisation 
charge in the gap (bottom) [108]. 
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inductance that would slow the response. These two issues were resolved 
by the use of an ‘accordion’ geometry. In this geometry the signals are 
read out by copper/polyimide flexible tapes which have much lower ca- 
pacitance and inductance than cables. The pre-amplifiers for the electro- 
magnetic calorimeters are placed outside the cryostat behind the active 
volume. 

Another benefit of the absorber geometry is that it achieves a projec- 
tive readout (organised in towers, which are oriented in the direction of 
the incoming particle) while minimising non-uniformities across the width 
of the readout cell. This geometry allows for a calorimeter without cracks 
between neighbouring cells. 

The stochastic term of the energy resolution measured with prototypes 
in beam tests was 10.1% /./E/GeV and the linearity was within +0.1% 
in the energy range 10 GeV < E < 180 GeV [31]. The constant term was 
(0.17 £0.04) %, but the constant term determined from an in-situ analysis 
in the experiment was larger (see Figure 11.11). 

Sampling calorimeters with a very small and adjustable sampling 
frequency have been studied by the SICAPO collaboration using sand- 
wich calorimeters with silicon detector readout. For 7 mm thick (2 Xo) 
thick tungsten absorber plates the visible energy was measured to be 
Eyis/M = (5.558 +0.004)E/GeV + (1.31.5) x 107° for an incoming 
electron with an energy in the range of 5 to 50 GeV [112]. For a calorime- 
ter made up of 24 Xo (12 cm total length), which was sufficient to ensure 
good longitudinal containment, a stochastic term of 24.9% /4/E /GeV was 
observed. The collaboration did also study other absorber materials and 
the dependency on the thickness of the absorber T (where T is defined as 
the sampling fraction measured in units of Xo for the absorber) and found 
scaling of the resolution with yT. 


CALIBRATION OF ELECTROMAGNETIC 
CALORIMETERS 


The signals from calorimeters are processed by suitable electronic circuits 
to provide amplification and filtering and finally digitised by ADCs. A 
calibration procedure is therefore required to convert back the raw ADC 
data to energy. 


11.7 
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Figure 11.8: The ATLAS accordion 
electromagnetic calorimeter. Left: Ge- 
ometry [86]. Right: Energy resolution 
as a function of energy measured using 
a prototype in a test beam [31]. The 
curve is a fit to the expected scaling with 
energy, Og /E=A/VE+B. 


If the pre-amplifiers are in the cold 
volume the noise would be slightly 
lower, but this results in more radiation 
damage and makes maintenance more 
challenging. In the ATLAS hadronic 
endcap calorimeter (HEC) calorimeter 
a different design choice has been made, 
due to the larger cell size that results in 
a larger capacitance and hence more 
noise. The design goal of being able to 
see MIP signals from muons required 
the use of cold pre-amplifiers for the 
HEC. 


The real figure of merit for a calorime- 
ter should be the full system perfor- 
mance, not the resolution achieved in 
a relatively small prototype in a test 
beam. 


The small error on the slope and the 
small constant term indicate good 
linearity of the response. 


This energy resolution is comparatively 
poor due to the small energy fraction 
sampled in the silicon. However, the 
strength of the silicon readout is the 
very fine achievable segmentation, 
which can be used for particle flow 
analysis (see section 11.13). 
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The first step in this procedure is to measure the ADC counts with no 
signal, and these values are subtracted from the raw ADC values. A very 
simplified calibration scheme is then given by 


E; = Fe\(ADC;) x F (Ij), (11.7) 


where F; is the reconstructed energy in cell i with an ADC value of ADC;, 
F.(ADC;) converts the ADC value to a measured current, and F (/;) con- 
verts the current to energy. The function Fa (ADC;) is determined using 
electronic calibration circuits in which known signal currents are injected 
into the amplifier and the measured currents are recorded. The conversion 
from measured current to reconstructed energy F (I;) requires a test beam 
study in which the response is measured for electrons of known energy. 

In this very simplified scheme the energy of an electron would simply 
be given by summing the value of £; for the cells assigned to the electron. 
If the electromagnetic calorimeter is divided into longitudinal samples, the 
performance can be improved by modifying eq. (11.7). The current for an 
electron cluster is first calculated in each layer. The reconstructed energy 
is given by 

E= > WlayerElayer- 


layer 


(11.8) 


The weights wWayer are determined empirically using data from a test beam 


by minimising 
2 
(4 E Ebeam ) 
i OF; 


=), 


where the sum runs over the events in the sample, E; is the estimated elec- 
tron energy for each event using eq. (11.8), Epeam the beam energy, and 
Or, is the estimated uncertainty in the reconstructed energy. 

In-situ calibration of electromagnetic calorimeters in collider experi- 
ments can be performed using the large sample of Z — ee~ events. We 
will consider the calibration of the ATLAS electromagnetic calorimeter as 
an example [87]. 

The scale of the measured electron energy is affected by upstream ma- 
terial in the Inner Detector (ID) and services. The interactions in the up- 
stream material will result in energy being absorbed before the calorime- 
ter and, more importantly, some of the lower energy electrons created in 
the upstream material will be deflected by the magnetic field so that the 
energy is no longer contained in the electron cluster reconstructed in the 
electromagnetic calorimeter. The material in the active volume of the ID 
can be studied using data from, for example, y conversions. 


(11.9) 
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Figure 11.9: Energy resolution as 
a function of yT for a lead/silicon 
calorimeter for a fixed energy of 
E = 4 GeV [157]. 


These are called ‘pedestals’. 


Here we have implicitly assumed the 
use of ionisation chambers to measure 
currents. A similar scheme for an active 
region that was measuring charge 
would replace current with charge. This 
will apply in particular for calorimeters 
using scintillators, which ultimately 
record charge in the photodetectors. 
For an example of a more detailed 
scheme, see [87]. 


For a large detector, it is impractical 

to use this procedure for all calorime- 
ter cells. The energy scale needs to be 
transferred from the cells calibrated in a 
test beam to the rest of the calorimeter. 
This can be achieved by various tech- 
niques, including radioactive sources 
and laser pulses. 


In a test beam the energy can usually 
be measured for reference by an up- 
stream magnetic spectrometer with high 
accuracy. 


We will first assume that the energy 
scale is the same for e~ and e*. 


The discussion given here is a very 
simplified version of the full procedure 
described in [87]. 


The probability of a gamma conversion 
depends on the material thickness in 
units of radiation lengths. Therefore 

a map of the reconstructed gamma 
conversion vertices acts like an ‘x-ray’ 
of the detector material. 
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The next step is to use the energy sharing between the first two longi- 
tudinal samplings to refine these estimates. After the upstream material in 
the simulation is tuned to agree with this data, a data-driven estimate of 
the electron energy scale is performed using Z — ete” events. The correc- 
tion factors œ; in a given slice of pseudorapidity are defined by 


poe — EMC(1 +a), (11.10) 


where E% is the measured energy and EMC 
from the Monte Carlo simulation. 

The correction factors œ; in bins i of 7 are determined from Z — e™ e7 
events, using 


is the corresponding energy 


miata — (1+ a%;)mN°, (11.11) 


ij 

Qij © (Qi +a) /2, (11.12) 
where miata (mM) is the Z mass peak in data (Monte Carlo) for which the 
electrons are in the pseudorapidity bins i and j. The values of the œ; are 
then determined by a numerical y? minimisation [87]. 
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To first order the energy scale for photons should be the same as for 
electrons but the sensitivity to upstream material is different. After cor- 
recting for these effects the energy scale for photons is validated using 
samples of Z — ete y and Z > u* u` y [87]. 

A similar methodology is used for the in-situ measurement of the 
electron energy resolution [88]. This uses a comparison of the measured 
widths of the distributions of reconstructed masses from a sample of 
Z — e*e” in real and simulated (Monte Carlo) data. The correction fac- 
tor c; in a given slice of pseudorapidity is defined by 


o (22) oa 


EMC 


(11.13) 


where E% is the measured energy and is the corresponding energy 
from the Monte Carlo simulation. The c; values are also determined from 
Z — e*e~ events, using the difference in quadrature between the mass 
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The energy sharing is also sensitive 

to the services between the ID and the 
LAr calorimeter, which are less well 
known than other types of material 
(for example the magnet). The effect of 
more material in this region will be to 
start the showers earlier and therefore 
increase the relative energy deposited 
in the first longitudinal sampling com- 
pared to the second. 


This procedure generates correction 
factors to be applied to the simulated 
data. These correction factors are 
evaluated in relatively coarse regions 
of phase space (typically n and pr). 
Therefore the Monte Carlo simulations 
are essential because they allow for 
variations in the detector response over 
smaller regions of phase space. 


Histograms of real data and simulated 
data are created in the (i, j) bins of n. 
For each bin a x7 value is computed 
comparing the measured and simulated 
samples. The overall x? is defined as 
the sum of the x? for each bin. This x7 
is minimised by varying the Œ; values. 


Figure 11.10: Reconstructed invariant 
e*e~ mass in the ATLAS electromag- 
netic calorimeter for different values 
for the scale factor œ for one electron 
with 1.63 < N < 1.74 and the other 
electron with 2.3 < n < 2.4 bins [87]. 
The variation of & in the plot illustrates 
the sensitivity of the reconstructed mass 
peak to this parameter. 


Additional validation of the electron 
energy scale is obtained from samples of 
J/Pa ete. 
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resolution measured in data and Monte Carlo 


data MC 
m m 
cij © (ci + cj) /2, (11.15) 


where ofp" (oN) is the Z resolution in data (Monte Carlo) for the pseu- 
dorapidity bin (i, j). The values of c; are then determined by a numerical 
x? minimisation using the measured and simulated mee distributions [88]. 
In practice simultaneous fits are performed to obtain @;; and c;; values. 
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11.8 HADRONIC CALORIMETERS 


In the previous sections of this chapter we have seen that electromag- 
netic calorimeters can make precise measurements of the energy and di- 
rections of e™ and photons by detecting the cascade of secondary particles 
produced by electromagnetic interactions. Hadronic calorimetry works 
on a similar principle that hadrons produce secondary particles in inter- 
actions with nuclei, which in turn can undergo further interactions. The 
energy of the incident hadron can be estimated by detecting the products 
of this avalanche. However, there are several factors that make hadronic 
calorimetry more challenging than electromagnetic calorimetry and typi- 
cally result in poorer resolution: 


e The interaction length for nuclear interactions in dense media is 
much longer than the radiation length (see section 2.12). Therefore 
hadronic calorimeters will need to be much larger or denser in order 
to achieve sufficient shower containment. This factor alone makes 
construction of a homogeneous hadronic calorimeter impractical, 
and all hadronic calorimeters are sampling calorimeters. 


e zs, which are copiously produced in hadronic interactions, decay 
into photons, which instigate electromagnetic cascades within the 
hadronic shower. The underlying physics of the electromagnetic 
cascade and the way particles in the electromagnetic shower deposit 
the energy is different than for hadronic cascades. In general this 
leads to a different calorimeter response for the electromagnetic 
shower component compared to the hadronic cascade. 
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In order to make precise and unbiased 
physics measurements we need the res- 
olution in the Monte Carlo simulation 
to be the same as in data. Therefore the 
values of ci are used to smear the Monte 
Carlo values. This is done by generating 
random numbers (g;) from a Gaussian 
distribution with u = 0 and o = 1 for 
each electron and adding an energy 

AE = giciE. 


Figure 11.11: Fitted values of the 
energy resolution correction term ci 

as a function of pseudorapidity (top 
panel) in the ATLAS electromagnetic 
calorimeter. The statistical and system- 
atic uncertainties are shown separately 
in the bottom panel [88]. The resolution 
is worse and the uncertainty is larger in 
the transition region between the barrel 
and endcap calorimeters because of the 
material from the services exiting the 
detector. 


For a more detailed, yet still educa- 
tional discussion of hadronic calorime- 
ters see [497]. 
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e The most significant issue is that a large fraction of the incident en- 
ergy in a collision of a hadron with a nucleus goes into effectively 
invisible modes (see section 2.12). The fluctuations of this energy 
fraction severely limits the achievable resolution. 


incoming hadron Figure 11.12: Components of a 
hadronic cascade. 
hadronic secondaries nuclear excitations 
T, K*, K,, Ky p, n, etc. n? spallation 
y . : 
y low-E nucleons, light nuclei 
hadronic shower Vv 
component em shower delayed photons 
component 
detected signal 1 #1 


relative to em 


11.9 HADRONIC SHOWER DEVELOPMENT 


The secondary particles from hadrons interacting with nuclei will contain 
n* and 2°. The 2* will induce further hadronic interactions. The 2° will 
decay promptly, 2° — yy, with the photons initiating electromagnetic 


cascades. We will see that this mixture of purely 
š PON, $ ae hadronic and electromagnetic shower 
A high energy hadron will induce many ‘generations’ of hadronic in- components will in general degrade the 
teractions, each producing the same average fraction of 2° and 2*, until resolution of a hadronic calorimeter. 


the energy decreases below the threshold for 7° production, and the en- 
ergy of the hadrons will be deposited in the medium as ionisation. In a first 
approximation, we assume that on average 1/3 of the secondary hadrons 
are 2° and 2/3 7. 
In this picture, after one generation of hadronic interactions, the elec- 
tromagnetic fraction of the energy deposited is 1/3. Assuming this process 
is repeated, then after the second generation the electromagnetic energy 
fraction is 1/3 + 2/3 x 1/3. This process then clearly results in an in- 
crease in the electromagnetic fraction as the number of generations in- 
crease. Therefore in this naïve model we expect the hadronic energy frac- 
tion to vary with the number of generations of hadronic interactions n as 


This assumes that all the secondary 
particles will be x* and n°. 


fn = (2/3)" [274]. The number of generations n increases with energy, a es a 
$ R A 2 : i em = l — fn increases with increasing 
therefore the hadronic energy fraction in a shower will decrease with in- energy. 


creasing energy. 

This argument must break down when the energy of the hadrons are 
below threshold for pion production. A more detailed analysis predicts a 
relationship of the form 


fa = 1- fem = (E/Eo)*"', (11.16) 


where Ep is a constant related to the energy threshold for pion production T™e value of Eo depends on which 
‘ 2 : nucleus is used, but is of the order of 

and k is aconstant with a value around 0.85 [255]. This more accurate 1 GeV. 

calculation still shows that the hadronic energy fraction in a shower will 

decrease with increasing energy. In the high energy limit the shower be- 


comes purely electromagnetic. 
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11.10 HADRONIC SHOWER DIMENSIONS 
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Figure 11.13: Particle spectra produced 
in lead in electromagnetic cascades 
initiated by 100 GeV electrons (left), 
and in hadronic cascades initiated by 
100 GeV protons (right), calculated from 
a FLUKA simulation. The y-axis shows 
the energy times the flux for a given 
energy Ed®(E)/dE (which equals the 
flux per unit lethargy, ®(u). Lethargy is 
defined as decrease in neutron energy, 
du = dE/E), to preserve the visual 
feeling on how many particles each 

bin counts for, given that the E axis is 
logarithmic. The integral of each curve 
gives the relative fluence for a given 
particle (Data courtesy of A. Ferrari). 


As for electromagnetic calorimeters, the depth required to give good con- 
tainment of the showers grows logarithmically with energy. For particles 
in the hundreds of GeV range a total absorber thickness of at least ~10 Aint 


is needed. 


The longitudinal shower profiles for pions of different energies are 
shown in Figure 11.14. The profiles for proton-induced showers are 


slightly different because of the larger pN inelastic cross-section. Shower 


profiles in different material show approximate scaling in Aint. The de- 


velopment of the shower after the maximum is dominated by the neutron 


component [237]. 


°210 GeV 


= 135 GeV 
+ 40 GeV 
po a 30GeV 
T * 20 GeV 
2 F © 10 GeV 
Sy o 5GeV 
at 
>S poss 
86 “ie 
BS i 
vo nee 
al. ~ 


5 6 7 
Calorimeter Depth [Amr] 


The widths of hadronic showers are also much larger 


T 


8 9 10 11 12 13 


than for elec- 


tromagnetic showers. As for electromagnetic showers, the width grows 


As it is common that the hadron 
calorimeter is used as a muon filter 

(see section 12.8), an additional advan- 
tage of making the hadronic calorimeter 
thicker is that it will reduce the back- 
ground of pions generating hits in the 
muon chambers behind the hadron 
calorimeter (‘punch-through’). 


Figure 11.14: Hadronic shower profiles 
for different pion energies for the 
WA78 calorimeter (U/scintillator up 

to 5.4 Aim, Fe/scintillator after that) 
[186]. 


with shower depth. We can understand the approximate lateral size at the 
end of the shower by considering the threshold for pion production. The 


threshold energy for pion production 77 


Ep > Tt 


Etn is 312 MeV (see 


exercise 4). As discussed in section 2.12, the transverse momentum in 
hadronic interactions is ~ 400 MeV/c. The hadrons produced by high en- 
ergy incident hadrons at the start of the shower will tend to be emitted at 
small angles to the incident hadron (see section 2.12). The pions (or other 
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secondary hadrons) produced at the end of the shower will tend to have 

momenta comparable to the threshold for pion production. Therefore these 

hadrons will have similar values for the longitudinal and transverse mo- 

mentum and therefore be produced with an approximately isotropic angu- 

lar distribution. As these pions will travel for a distance of about one Aint, 

the width of the hadronic shower will grow by about this amount. This is 4 useful rule of thumb [330] is that 
aie . cy, i i 95% of the hadronic shower will be 

guiding the choice of transverse granularity in a hadronic calorimeter as contained within a radius of 1 int- 


there will be no gain in having a granularity very much finer than one Aint. 


11.11 ENERGY MEASUREMENT IN HADRONIC 
CALORIMETERS 


As discussed, hadronic interactions inevitably result in ‘invisible’ energy. 
In addition, the low energy protons and light nuclei from spallation will be 
very heavily ionising and therefore they will have a very short range in a 
dense absorber. Hence, most of their energy will be deposited in the pas- 
sive rather than the active layers of a sandwich calorimeter. Therefore, in 
general we expect that the calorimeter response to electrons will be greater 
than that of hadrons at the same energy. 

In order to understand the implications for the calorimeter energy reso- 
lution it is convenient to define the ratio of response of electrons to pions, 
and of electrons to hadrons that do not generate any photons or electrons 
in their interactions (i.e. they produce 2+ but not 2°). We define the corre- 
sponding ratios symbolically as 


E (e) E (e) As will be discussed below, these ratios 
e/h = —— and efn = Dy? ad 1. 17) are a property of the calorimeter, given 
E (h) E (7 ) by its geometry, materials and readout. 

Y US g ) 


where E(e) is the detected signal (energy) from an electron, and 7% and h 
indicate the same property for a real 2~, and for a hadron creating only a 
non-electromagnetic (purely hadronic) response, all for the same incoming 
particle energy. 

Most of the visible energy in a high energy hadronic shower will re- 
sult from low energy hadrons towards the end of the cascade where there 
are more particles. Let fem be the fraction of electromagnetic energy re- 
leased in an interaction of a 7* with a nucleus. We can then write down an 
expression involving real z~ as 


E(e) 
eJn = y (11.18) 
T= FE) + (1 — fim EC) 
which we can rewrite in a more useful form as 
I 
ejn = an (11.19) 


1— fem(1 —e/h) 


The first issue for high-resolution hadronic calorimetry is that there 
are large event-by-event fluctuations in fem and therefore, if the value of Jem can be measured for each event in 

Se ee . . : . : . dual readout calorimeters, as discussed 
e/his significantly different from 1, this will result in fluctuations ine/7. below. 
This in general degrades the resolution of the energy measurement of a 
hadron and the arguments which led to the resolution of an electromag- This is often accommodated by adding 
netic calorimeter varying as og /E =A/E will no longer be valid and the 4 (large) constant term to the A/ VE 


: À : Á term in the parameterisation of the 
energy resolution will generally improve more slowly with energy. resolution. 
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The second issue is that fem increases with energy, as shown in 
eq. (11.16). This will mean that the response is non-linear if the value of 
e/his significantly different from 1. If we were only interested in mea- 
suring single hadrons, then the non-linearity could in principle be cor- 
rected by a calibration factor. However, we usually need to measure a ‘jet 
of hadrons containing a mixture of hadrons with unknown energy, so the 
non-linearity cannot be simply corrected. 

However, if we can ensure that our calorimeter satisfies e/h = 1 then 
from eq. (11.19) e/a = 1 and therefore we are completely insensitive to 
the value and fluctuations in fem. In sampling calorimeters we can tailor 
the response to the different shower components, which allows to achieve 
this goal. This goes by the general name of ‘compensation’, and we will 
review different approaches in the following. 


’ 


LOWERING RESPONSE TO ELECTRONS 


One approach to the problem is to lower the average response to electrons 
to achieve the desired condition e/h = 1. To consider this approach quanti- 
tatively it is useful to define in a similar fashion as before symbolically the 
ratio of the response of electrons and MIPs, e / MIP. 

As discussed in section 11.3, it is the low-energy interactions towards 
the end of the electromagnetic shower development that have a significant 
effect on the signal recorded in the calorimeter, and the energy recorded 
will be different for the neutral and the charged component in the shower. 
For low energy electrons generated in the passive layers of a sampling 
calorimeter the stopping power due to ionisation/excitation energy loss 
is high and they will have a very short range. However, electrons gener- 
ated near the edge of the passive absorber can reach the active layers and 
make a large contribution to the ‘visible’ energy deposited. For photons 
the cross-section for Compton scattering scales with Z (the atomic num- 
ber) and the cross-section for the photoelectric effect scales more rapidly 
with Z. Therefore, if the atomic number of the passive layers (Zpassive) iS 
equal to that of the active layers (Zactive) we expect e/MIP = 1. However, 
in practice we need Zpassive > Zactive to make a compact calorimeter, and in 
this case e/MIP < 1. 

These insights provide a possible way to further reduce the value of 
e/MIP: if we add a thin lower-Z absorber behind the higher-Z absorber, 
low energy electrons produced in the high-Z absorber cannot reach the 
active layer, as they are stopped in the low-Z absorber, but in the low-Z 
absorber fewer electrons are produced due to the lower interaction cross- 
section for photons. The net result will be to lower the value of e/MIP. 
This effect was studied systematically by the SICAPO collaboration who 
used silicon as the active layers, high-Z absorbers (uranium or tungsten) 
and added thin layers of low-Z material (G10) downstream of the high-Z 
absorbers [330]. 


RAISING THE RESPONSE TO HADRONS 


The fraction of energy lost to nuclear breakup and the fraction of energy 
given to the neutrons are both approximately constant with energy. There- 
fore, if we can efficiently detect the neutrons in the active layers, there is 
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For homogeneous calorimeters e/h 
would always be >1. Hence, homoge- 
neous calorimeters, even if they would 
be dimensionally feasible, would always 
be uncompensated. 


To a good approximation we can use 
muons to measure the calorimeter 
response to MIPs. 


The scaling for photoelectric effect is in 
the range Z? and Z. 


In typical calorimeters e/MIP ~ 0.6. 


G10 is a high-pressure fiberglass lam- 
inate that is a base material for the 
fabrication of printed circuit boards. 
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the possibility to compensate for the lost energy and therefore achieve the 
compensation condition, e/h = 1. 

Most of the neutron energy comes from low-energy evaporation neu- 
trons with energies ~ 1 MeV. We can transfer some of this neutron energy 
to nuclei by elastic scattering. The average energy transferred to the struck 
nucleus with mass number A, by a neutron of energy En is 


(AE) =“ 


ae (11.20) 


Therefore the most efficient case corresponds to A = 1, i.e. hydrogen. In 
practice the most convenient scheme to introduce hydrogen is to use plas- 
tic scintillator for the active layer. 

The path length of low energy neutrons can be G(10 mm) in dense 
media. Therefore these neutrons can cross several passive and active lay- 
ers. From eq. (11.20) we can see that the average energy loss for neutrons 
in collisions in the high-Z (and therefore high-A) absorber will be very 
small, whereas collisions with hydrogen in the plastic scintillator will re- 
sult in 50% of the energy being transferred on average. The resulting low 
energy protons and struck nucleus will have a very short range and there- 
fore deposit all their energy locally. The signal resulting from the neutron 
elastic scatters in the plastic scintillator will be detected efficiently even 
if we have thin active layers, but the response to electrons will be smaller 
because of the reduction of overall active material. Therefore we can in- 
crease the hadron response and hence decrease e/h by increasing the ratio 
of thicknesses of the passive absorber (tp) to active layer (ta). 

A large compensating calorimeter was built by the ZEUS collaboration 
for the HERA ep collider, using uranium wrapped in stainless steel for the 
passive layers and scintillator for the active layers [58, 511]. Compensa- 
tion was achieved by a combination of decreasing the electron response 
by the use of a combination of high-Z and low-Z absorber (3.2 mm thick 
uranium plates wrapped in 0.5 mm thick stainless steel) as discussed in 
the previous section, and by having a large value for tp/ta to increase the 
hadron response. 

In order to achieve e/h ~ 1 a ratio tp/tg ~ 4 was required. The thick- 
ness of the scintillator plates could not be less than 2.5 mm, therefore very 
thick absorber plates had to be used. This had the drawback of increasing 
the sampling fluctuations for electromagnetic showers. 

Very good hadronic energy resolution was achieved in a testbeam ex- 
periment [511], but the electromagnetic energy resolution was relatively 
poor compared to other calorimeters 


Og /E = 35%/\/E/GeV 2% (hadronic), 


(11.21) 
oE /E = 18%/\/E/G&V © 1% (electromagnetic). 


To be able to achieve the optimal ratio of passive to active absorber 
without compromising the resolution for electrons and photons, a different 
geometry has been proposed: the ‘spaghetti’ calorimeter or SPACAL [15]. 
In this approach lead was used as the passive absorber and 1 mm diameter 
scintillating fibres were placed in holes running through the calorimeter in 
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See exercise 5. 


Organic scintillators use plastic base 
material such as polystyrene, (C3Hs)n, 
which have a high hydrogen content. 


The increase in the signal might be 
limited by saturation effects. See for 
example for scintillators the discussion 
in section 5.1. 


Consequently, the energy measurement 
of hadrons (and jets) will improve 

for smaller sampling fractions, an 
observation that might appear counter- 
intuitive. However, this reduction helps 
as it boosts the hadronic relative to the 
electromagnetic response. The energy 
resolution for electromagnetic showers 
on the other hand degrades for smaller 
sampling fraction, as discussed in 
section 11.6. 


The (depleted) uranium was radioactive 
and the stainless steel was also required 
for radio-protection reasons. 


Ina scintillator, light is captured by to- 
tal internal reflection and transferred to 
the edge of the plate. In practice, there 
will be surface imperfections and there 
will be less than total internal reflec- 
tions. The number of reflections increase 
as the plate thickness decreases. There- 
fore too thin scintillator plates result in 
larger absorption for the scintillation 
light. 


Figure 11.15: Cross-section of a lead 
absorber plate for the hadronic section 
of the HI SPACAL [63]. In this detector 
ty/ta = 3.4: 1, and the fibre diameter 
was I mm. 

For a discussion of scintillating fibres 
see section 5.2. 
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the direction of the incident particles. The fibres were grouped into bun- 
dles and connected to photomultiplier tubes for the readout. A very good 
resolution for hadrons was obtained, 


27.7 
OF P D25. 
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This type of calorimeter was proposed for use at the LHC but this 
technology was not adopted because it could not compete with alternatives 
for the electromagnetic energy resolution. In addition, it did not allow for 
projective geometry which is very important for electrons, photons, ts and 
QCD jets. 

A summary of the resolution achieved for hadronic calorimeters us- 
ing uranium/scintillators [495] is shown in Figure 11.16. As expected, we 
obtain the optimal intrinsic resolution for a hadronic calorimeter if the 
compensation condition (e/h = 1) is satisfied. 


(11.22) 
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If a hadronic calorimeter has sufficient granularity it is possible to improve 


the resolution of a non-compensating calorimeter in software by trying 
to apply different weights to the electromagnetic and hadronic shower 
components. A first demonstration of this technique was made using a 
simple non-compensating iron/scintillator sandwich calorimeter for the 


CDHS neutrino experiment [13]. The idea behind the very simple software 


compensation algorithm was to de-weight large local energy depositions 
as they were probably due to electromagnetic energy. 
The weighted cell energies were calculated from the unweighted ener- 
gies E; using 
Ej = (1—CE;) Ei, 


where C is an empirical constant that optimises the resolution. A signifi- 
cant improvement in resolution is achieved by this simple algorithm. Note 


(11.23) 
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It was also significantly more expensive 
than alternative technologies. 


The lack of projective geometry in the 
design proposed at the time would 

also have been incompatible with the 
particle flow analysis discussed later in 
this chapter. 

The original motivation for the use of 
depleted uranium absorbers has been 
the expectation that neutron-induced 
fission would boost the hadronic signal 
(see for example [238]), and thus lead 
to a compensated calorimeter. While 
this indeed contributes, today the bene- 
fits of this absorber are more seen in its 
large atomic number Z. 


Figure 11.16: Intrinsic resolution and 
e/h ratio of different uranium/plastic 
scintillator hadronic calorimeters as a 
function of sampling fraction [495]. 
The intrinsic resolution is defined as 
Ofitrinsic = OP caaied T Onasciling where 
Omeasured İS the directly experimentally 
measured resolution and Osampling İS the 
expected contribution to the resolution 
from sampling fluctuations (estimated 
using eq. (11.1)). 


For this type of calorimeter we expect 
e/h > 1 for the reasons discussed 
earlier. 
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also that the resolution after weighting scales as 1 / VE , unlike before the 
weighting. This compensation relied on very fine longitudinal sampling 
which would not be practical for a hadron calorimeter for a collider detec- 
tor. 
Much more sophisticated software compensation algorithms have 
been developed by the LHC experiments which have more (less) granu- 
lar calorimeters in the transverse (longitudinal) direction. As an example, 
the measured jet energy resolution for the ATLAS detector [89] is shown 
in Figure 11.18. The noise term is a major factor at low value of pi but is Te en Amour se cael 
very small at high values. The resolution at the highest momenta show a see seat a ae ne 


large constant term because the software compensation is imperfect. also some experimental systematic 
uncertainties depend on pr, not E. 


+ 0.4¢ T : T 3 Figure 11.18: Jet energy resolution 
~ ost ATLAS F Anti-k, R = 0.4 (PFlow+JES)4 measured by ATLAS as a function of the 
g TE Vs = 13 TeV, 44 fb 0.2 < Inl < 0.7 3 jet transverse momentum pe [89]. The 
© 0.36 + Dijet in situ E eorum used to aint ees 
¢ = objects into jets is described in ; 
6 0.25 ; PIE J 
2 — Noise term in situ, Nip, 3 R= \/(6n)? + (A@)?, where An (Ad) 
3 0.2 "mmm /n situ combination, J is the separation in ņ (@) between a 
E Nip, ® Sip, eC 4 eat 
2 0.15 TA VP, E| cluster and the jet axis. Monte Carlo 
5 a a O OTT MC prediction J simulations are used to correct the 
= oE q jet energy to that of the particle-level 
z 0.056 4 energy of the jet (‘JES’). ‘PFlow’ refers 
5 E Sas ; k to the use of tracker measurements 
0 20 30 102 2x10? 10° 2x10 for charged particles in the jet recon- 
p“ [GeV struction as opposed to calorimeter 
T 


measurements (see section 11.13). 


The resolution is quite good but not as good as could be achieved by a 
compensating calorimeter (compare to for example [511]). This is another 
illustration of the fact that detector designs involve compromises which 
depend on the physics priorities. For ATLAS and CMS, priority was given 
to the electromagnetic calorimeter resolution, at the cost of poorer hadron The high resolution electromagnetic 


% š : calorimeters were essential for the 
reconstruction in the overall calorimeter systems. discovery and study of the Higgs boson 


in the H + yy and H > ZZ*, Z — e+e 
decay modes. 


DUAL READOUT CALORIMETERS 


The final approach to trying to improve the hadronic resolution for non- 
compensating calorimeters is to provide, in addition to the measurement 
of the energy deposited by the hadronic shower, a measurement of the 
fraction fem of the energy deposited by the electromagnetic component 
for each event. A way to achieve this is by measuring the light signal 
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from Cherenkov radiation in the calorimeter. This is predominantly sen- 
sitive to the electromagnetic shower component because the electrons 
and positrons in this component are relativistic down to energies of about 
200 keV. Most of the non-electromagnetic energy deposition in hadron 
showers, on the other hand, is from non-relativistic protons generated in 
nuclear reactions. While these will not generate Cherenkov radiation, they 
will produce signals by ionisation, for example in liquid noble gas ion- 
isation chamber detectors, or by scintillation. By combining these mea- 
surements with the Cherenkov light signal generated in the shower, the 
electromagnetic shower fraction can be determined for each event, and 
considered in the reconstruction. 

After an independent calibration of the two signals for electromagnetic 
events (i.e. with electrons), the reconstructed energy for hadrons is given 
by 

S = E [fem + (h/e)s(1 — fem)] 
C =E [fem + (h/e)c(1 — fem) 


where the (h/e)s c are the ratios of the hadronic response over the response 
for electrons for the two types of readout. fem can be eliminated from this 
system of equations, and the hadronic energy is obtained from 


(for scintillation), 
(for Cherenkov), 


= S=xC 
a er 


with x = [1—(h/e)s]/[1 — (h/e)c]- 

This approach avoids several constraints usually imposed by compen- 
sation. The absorber material does not have to have a very high Z, cop- 
per is sufficient. There is no constraint on the sampling fraction, it can be 
made large enough to allow for a good energy resolution for electromag- 
netic showers. Finally, there is no need to detect neutrons, with the asso- 
ciated long time constants. However, it is more challenging to achieve a 
projective geometry with this approach. 

The DREAM calorimeter [327] was a prototype of a sampling 
calorimeter to demonstrate this concept, with a copper absorber and ac- 
tive elements comprising doped (for scintillation) and undoped plastic or 
quartz fibres (for detection of Cherenkov photons) that are read out by 
different PMTs. The volume ratio was Cu/doped fibre/undoped fibre/air 
69.3/9.4/12.6/8.7 and the sampling fraction for MIPs was 2.1%. The depth 
of the prototype was 10 Aint [327]. 

In this prototype y was about 0.29. The energy resolution was lim- 
ited by lateral shower leakage due to the limited size of the prototype. 
However, a clear improvement in hadronic energy resolution after com- 
bining the independent signals was observed, with a very small constant 
term [327]. Furthermore, after combination the signal scales for different 
particle types do agree, also with the calibration obtained from electrons. 

Dual calorimeter readout has also been investigated for crystal 
calorimeters, where the different time scales of the prompt Cherenkov 
and the delayed scintillation signal can be exploited. The main challenge 
here is the strong attenuation of the short-wavelength Cherenkov photons 
in the crystal, which leads to a non-linear response of the calorimeter for 
this signal [33]. 
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Figure 11.19: Cross-section of the basic 
cell of the DREAM calorimeter [32]. 

7 fibres (3 doped and 4 undoped) are 
inside a hollow square copper rod. 
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Figure 11.20: DREAM calorimeter prototype [327]. Left: Energy resolution for multi-track events (‘jets’). Right: Energy scale for multi-track events 
and single pions, compared to electrons. ‘Corrected’ in both cases means results derived with both readouts combined. 


11.12 SIMULATION OF CALORIMETERS 


As we have seen throughout this chapter, the development of electromag- 
netic and hadronic showers is governed by complex processes, and the ac- 
curacy of analytical predictions of the shower development and the quality 
of the energy measurement are limited. Therefore, such predictions require 
the use of Monte Carlo techniques to ‘simulate’ real showers. The basic 
idea is to use the Monte Carlo technique to track particles as they propa- 
gate through matter and create secondary particles, until the energies of 
the particles fall below some defined threshold at which point the particles 
are stopped and their energy deposited locally. After generating the tra- The Monte Carlo method is explored for 
A : i ` . ` a very simple example in exercise 7. 
jectories of all secondaries in the calorimeter, the total energy deposited 
in small volumes of the active layers are stored. In a separate pass the de- 
posited energy is used to calculate the response of the detector. This can 
include the response of the full electronics chain used in the readout and 
therefore can output data in an identical format to that of real data. This 
allows the same reconstruction software to run on real and simulated data, 
thus allowing for calculation of all aspects of the detector response (e.g. 
resolution or efficiency for a given type of particle). 
One difficulty with this approach is to accurately model the full detec- 
tor geometry, due to the large number of small segments required in the 
simulation. This is very challenging for complex collider detectors, where 
the geometry of the tracking detectors needs to be accurately modelled 
as well as that of the actual calorimeters. To keep the number of volume 
elements practical some simplifications are usually required. 
Apart from the general difficulty of modelling complex geometries, 
minimum energies to track produced particles have to be carefully se- 
lected. The best accuracy will require the lowest energy thresholds but 
this will come at a cost in CPU time. Therefore, any application will haye Rather than just set global thresholds, 
: $ the thresholds can be adjusted depend- 
to assess this trade-off to select the optimum thresholds. ing on the location of an interaction. 
Simulations of electromagnetic showers are in principle simpler than This can be used to improve the mod- 


: . : elling of the effects of 5 rays without too 
hadronic showers as the underlying physics processes are less complex. large a cost in CPU time. 
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The most comprehensive and widely used software for these type of 
Monte Carlo simulations is GEANT4 [29]. 

An example for the study of the accuracy of a GEANT4 simulation for 
electron-induced showers is the comparison of measured and simulated 
data for a test module of the ATLAS electromagnetic accordion calorime- 
ter [31]. This calorimeter was longitudinally segmented into a 1.7 Xo thick 
pre-shower detector followed by a front, middle and back compartment 
with thicknesses of 4.6 Xo, 17.6 Xo, and 5.0 Xo, respectively. A measure- 
ment sensitive to the longitudinal shower development is the fractional en- 
ergy deposited in the first section of the calorimeter. The transverse width 
of the shower was studied by measuring the distribution of the fractional 
energy deposited in the -strips of the first section of the calorimeter. The 
Monte Carlo results for the longitudinal and lateral shower shapes are in 
good agreement with the test beam data. 
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GEANT4 is widely used in particle 
physics, but it is had many applications 
in other domains. For example, it is 
used in nuclear medicine, where it 

is used in planning radiotherapy to 
optimise the delivered doses. 

For purely electromagnetic processes, 
the older EGS4 code can be used. A 
more modern version of the EGS4 

code is maintained by NRC [299]. A 
systematic comparison between EGS4 
and GEANT4 can be found in [384]. 
Other codes for simulating hadronic in- 
teractions such as CALOR and FLUKA 
can now be used within GEANT4. 
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Figure 11.21: Comparison of measured and simulated (GEANT4) shower shapes for the first section of the ATLAS accordion calorimeter [31]. Points 
are data and lines are the result from Monte Carlo simulations. The shaded area corresponds to the uncertainty in the amount of material before 

the calorimeter. Left: Reconstructed energy fraction in the first compartment for 10 GeV (open circles) and 100 GeV (full circles) electrons. Middle: 
Reconstructed energy fraction in the different N (pseudorapidity) slices for 100 GeV electrons. The width of the slices is 0.025. Right: Reconstructed 


electron energy for 100 GeV electrons. 


A particularly important test of the accuracy of the Monte Carlo sim- 
ulation is the distribution of measured energy. The data show a Gaussian 
peak and a ‘tail’ on the low side. In a test beam the tail is due to electron 
bremsstrahlung in upstream material resulting in lower energy electrons 
being deflected out of the beam by the magnets in the beamline, The sim- 
ulation is in agreement with the data both for the core and the tail of the 
distribution. 

It is much more difficult to achieve the same accuracy for the more 
complex geometry of a collider detector. Therefore as discussed in sec- 
tion 11.7 in-situ calibration procedures are required to achieve the optimal 
performance. 

The simulation of hadronic showers is more complex. As discussed in 
section 2.12 there is no fundamental theory for hadronic reactions in the 
appropriate phase space region. In addition we need to consider energies 
ranging from thermal neutrons to TeV. There is no solution that will be 
suitable for all tasks and many different approaches are required to cover 
this range. 

When sufficient data is available, the optimal strategy is to use this to 
create a data-driven model. An important example of this approach is the 


In a collider detector, the tracking 
detector will be G(1 Xo) thick, so there 
will be a high probability for electrons 
to emit bremsstrahlung photons before 
hitting the calorimeter. This can lead to 
lower energy electrons being deflected 
more by the magnetic field and not 
being included in the reconstructed 
electron cluster. 


Even higher energy hadrons need to be 
considered for simulating detectors for 
very high energy cosmic rays. 
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use of very extensive data for low energy neutrons. This method provides 
accurate simulation of neutron transport and isotope production. 

When there is some relevant data but the coverage is insufficient for 
a purely data-driven approach, parameterised models are used. These use 
fits to experimental data to interpolate between data points and if neces- 
sary to extrapolate. An example of this approach is the interpolation of the 
measured pion-nucleus cross-sections by the GEISHA code [249]. 
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When there is insufficient data for this approach, we rely on theory 
models. A very wide range of models are available for different energy 
regions and processes. At high energy, one approach is based on the par- 
ton ‘string’ model. At low energies, models of intra-nuclear cascades are 
required. The choice of which set of models to use will depend on the ap- 
plication and the available CPU resources. 

As an example of the performance of a GEANT4 simulation for high 
energy pions, Figure 11.23 shows the ratio of measured energy resolution 
from test beam data of the ATLAS Hadron Endcap Calorimeter (HEC) 
to that from simulations [308]. There is an approximate agreement be- 
tween the data and the simulation. As for electromagnetic showers, the 
best accuracy requires in-situ calibrations during operation as discussed in 
section 11.14. 
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The GEISHA code is included in the 
GEANT4 package. 


Figure 11.22: Pion-nucleus cross- 
sections as a function of pion momen- 
tum. The circles are the measured data 
and the curves are the model parameter- 
isations. The upper panel is for the total 
cross-section and the lower panel is for 
the inelastic cross-section [249]. 


In the Lund string model the strong 
gluon self-interactions form a colour 
flux tube. As the tube is stretched it 
creates new hadrons. 


Figure 11.23: Ratio between the simu- 
lated and measured energy resolution 
for the ATLAS HEC as a function of 
pion energy [308]. LHEP and QGSP re- 
fer to different physics models selected 
in GEANT4. GEANT3 refers to an older 
version of the code. 
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11.13 RESPONSE TO HADRONIC JETS 


Our discussion of calorimeters has focused on the response to electrons 
and single hadrons. However, in high energy physics we often need to 
measure QCD ‘jets’ rather than single hadrons. In the theory of QCD we 
can use perturbation theory to predict the cross-sections of events at the 
quark and gluon level. However, these quarks and gluons must be confined 
inside hadrons and they therefore hadronise into jets of hadrons. These 
hadrons will tend to have low values of pr about the axis of the original 
quark or gluon and will therefore appear like a narrow jet of particles in a 
detector. 

In order to reconstruct the kinematic properties of the primary parton, 
we need a jet algorithm that will combine the appropriate energy deposi- 
tions in different cells of the calorimeter into a cluster of energy belonging 
to the jet. 

The conceptually simplest approach is the cone algorithm. The highest 
pr cell in the calorimeter is used as a seed and all cells within a cone of 
some fixed radius R = \/(An)? + (A@)? about the seed cell are clustered. 
The process can be repeated for new seed cells until there are no unused 
cells above a certain threshold in pr. There are theoretical problems with 
this naive algorithm and more sophisticated algorithms are used [174]. 


JET MEASUREMENTS USING PARTICLE FLOW 


An alternative approach to improving the resolution of hadronic jets for 
detectors not using compensating calorimeters is to use a ‘particle flow’ 
algorithm [451]. This method relies on the observation that typically in a 
jet about 60% of jet energy is carried by charged hadrons, 30% by photons 
(mostly from 2° — yy), and only about 10% by neutral hadrons (mostly n, 
Ky). If a good charged particle momentum measurement is available (typi- 
cally from the tracker in a multi-detector experiment), and if the individual 
particle contributions can be resolved, then the charged hadron component 
of the jet can be measured more accurately in that way. At the same time, 
energy resolution for the measurement of photon energies will be better 
measured in the electromagnetic calorimeter. Only the contribution from 
neutral hadrons must be obtained from the calorimeters. 

The simple idea has many complications in practice; the energy de- 
posited by the charged particles in the calorimeters can overlap with the 
electromagnetic and neutral hadronic energy and therefore needs to be 
subtracted to avoid double-counting. Ultimately, the resolution of this 
method is dominated by ‘confusion’ in the matching and assignment of 
energy depositions to different shower components. 

The particle flow approach was studied for the proposed experiments 
at a future linear et e~ collider. Monte Carlo simulation studies showed a 
jet energy resolution of [473] 


Of 21 aa 
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The first term represents the intrinsic calorimeter resolution, and the fol- 
lowing terms represent imperfect tracking, leakage and confusion, respec- 
tively. 
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We need this if we want to compare 

data to QCD predictions. We also need 
this in order to use jets to reconstruct 
masses of primary objects, e.g. for the 
Higgs decay H —> bb we need to use 

the measurements of the kinematic 
properties of the b jets to reconstruct the 
mass of the Higgs. 


An is the separation between a cell and 
the jet axis in the longitudinal direction 
with an equivalent definition for Ag in 
the azimuthal direction. 


This is particularly challenging in 
the core of a jet where there is a high 
particle density. 
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At the LHC the particle flow approach is very challenging because of 
the large ‘pile-up’ background arising from interactions in the same beam 
crossing as the triggered event. The particle flow approach is less sensi- 
tive to pile-up than purely calorimetric measurements as the momenta of 
particles coming from vertices other than the ‘primary’ can be discarded. 
However, at high pile-up the vertices are not always sufficiently separated 
so there can be confusion. Therefore, at low transverse momentum the 
resolution deteriorates with increasing number of pile-up interactions. 

The particle flow approach has been used to improve the resolution 
for hadronic jets at lower energies. At very high energies, the momen- 
tum resolution of the charged track measurements deteriorates, which 
decreases the achievable gain with this approach. An example of the jet 
energy resolution that can be achieved with this approach in the CMS ex- 
periment [201] is shown in Figure 11.24. 
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11.14 CALIBRATION OF HADRONIC CALORIMETERS 


As for electromagnetic calorimeters, we require an electronic calibration 
system to convert ADC counts to charge (typically in fC). We then need 

a calibration from charge to deposited energy. Small test calorimeters can 
be used to study the response to single hadrons and determine this con- 
version constant. The calorimeter in a typical particle physics experiment 
is divided in an electromagnetic and a hadronic section. Typically a large 
fraction of the energy of a hadron will be deposited in the electromagnetic 
calorimeter before reaching the hadronic calorimeter. Therefore these test- 
beam calibrations will have to include a section of the electromagnetic 
calorimeter before the hadronic calorimeter as in the final detector. 

As for large electromagnetic calorimeters, it is impractical to calibrate 
all cells of a very large hadron calorimeter in test beams. The calibration 
from the part measured in the test beam has to be transferred to the rest of 
the calorimeter by e.g. using radioactive sources. 

The calibration schemes for LHC hadron calorimeters need to allow 
for many effects, including pile-up, the hadron spectrum within a jet of 
different flavours and the non-compensating calorimeters used. Finally, 
if a particle flow analysis (PFA) is used, then this requires an additional 
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Pile-up contributes a momentum- 
independent ‘noise’ source. Therefore 
the effect on the fractional resolution is 
larger at low pr. 


The improvement using particle flow 
was particularly important for CMS 
compared to ATLAS because of the 
higher precision tracking detector (due 
to the large magnetic field) and the 
limited performance of the hadronic 
calorimeter. 


Figure 11.24: Jet energy resolution 
determined from simulated events by 
CMS using the particle flow algo- 
rithm [201]. u is the mean number of 
pile-up interactions. 


Hadrons don’t ‘know’ that the calorime- 
ter is divided into electromagnetic and 
hadronic sections! 
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calibration step. These procedures need to be developed on Monte Carlo 


samples in which the ‘truth’ jet energy can be compared to the ‘measured’ 
response. The Monte Carlo simulation itself will 


a oa : 3 be tuned to agree with the test beam 
The uncertainties arising from the use of Monte Carlo simulations can measurements. 


be reduced by using in-situ calibrations to determine the average response 
for hadronic jets. Several different samples are used in these studies pro- 
viding different information. These include 


e Z+jet (with Z— ete~ or Z— u* u`). The Z decay products can be 
measured with much greater precision and accuracy than hadronic 
jets in the tracking systems. Therefore they can be used as a proxy 
for the ‘truth’. The response in transverse momentum pr is defined 
as R(pr) = pr(jet)/pr(Z). Then, R(pr) can be measured as a func- 
tion of pr in some range of pr and pseudorapidity 71. Similar stud- 
ies can be performed with y+jet events, which have a larger cross- 
section and the analysis can therefore be extended to higher values 
of pr before the statistical uncertainties become too large. There are more subtle second Order 


effects we have ignored, so the analysis 


ie : ‘ b ‘ i : is repeated in a Monte Carlo sample to 
e Di-jet balance. There will be insufficient Z+jet events with the jets Syrin RYE (pr) and one then i 


in the forward (high |n|) regions of the calorimeter. If the central the ratio R(pr)/R“° (pr) to determine 
r : : 3 P data driven correction factors. 

region is calibrated, then the calibration can be extended to the for- 

ward regions by using di-jet balance, with one jet in the central re- 

gion and one in the forward region. In di-jet events, the pr of the 

two jets should be equal to ensure momentum conservation. In this 

procedure the central jet serves as the proxy for the ‘truth’ jet and 

hence determines R(pr). 


e Multi-jet balance. There will also be insufficient Z+jet events with 
very high pr for the Z+jet calibration to work in this range. Multi- 
jet events have large cross-sections and we can use events in which 
one high-pr jet is recoiling against two lower-pr jets. If the lower- 
pt jets have been calibrated, then they serve as proxy for the ‘truth’ 
and so can be used to extend the determination of R(pr) to higher 
values of jet-pr. 


Using these in-situ techniques [201] in the LHC general purpose ex- 
periments the jet energy scale is known to a precision of 1%. 
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Finally, W — qq samples can be identified using t? + bbW +W- 
events with one W decaying leptonically and one decaying hadronically, 
W — qq. The distribution of the reconstructed W — qq mass using these 
samples can be compared for data and simulation. The good agreement 


between data and simulation is an important cross-check for the jet energy 


calibration. 
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In a similar approach to that used for the electromagnetic calorime- 
ter, we can also use an in-situ analysis to determine the difference in the 
jet energy resolution between data and Monte Carlo. This difference can 


then be used to modify the simulated jet energies so that the jet energy res- 


olution is compatible with the resolution observed in the real data. One 


approach is to use clean di-jet events (i.e. with no third jet) to estimate the 


resolution. The asymmetry of the two jet energies is defined as [89] 


b 
ga a 
(oro) 


where pet is the pr in a well-measured region, and ee is the pr in the 


region being studied. 

This asymmetry is sensitive to the jet energy resolution as well as 
many physics effects such as additional radiation, so it does not provide 
a direct measurement of the jet energy resolution. However, these physics 
effects can be parameterised from the Monte Carlo simulations of the pri- 


mary particles (“truth” particles). The measured asymmetry, Cig can 


then be described as a convolution of the detector asymmetry, qo 
and an asymmetry from the physics effects, a Therefore the detector 
level asymmetry can be determined by subtracting the truth asymmetry in 


quadrature 


probe,det 


Odet = 0P agmi 


This deconvolution procedure can therefore be used to determine the in- 
trinsic jet energy resolution [89]. 
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Figure 11.26: Reconstructed W mass 
from W > q@ in tt events in the AT- 
LAS experiment. Good agreement is 
observed between data and simula- 
tion [90]. 


The RMS value of the asymmetry is 
measured in a sample of events. 


The asymmetry will be affected because 
radiation results in extra jets and the 
two leading jets will not necessarily 
have equal values of pr. 


The symbol ‘©’ represents a subtraction 
in quadrature (in a similar way to ‘° 
that represents addition in quadrature). 
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Key lessons from this chapter 


e In particle physics calorimeters incoming particles deposit their energy. The signal 
generated in the calorimeter is a function of this energy. 


e Calorimeters are the only detectors capable of efficiently detecting neutral particles. 


e The energy is absorbed in the calorimeter in detection quanta (ionisation charges, 
scintillation photons, phonons, etc.). The resolution of the energy measurement is 
driven by the counting statistics of these quanta. In the simplest case the fluctuations 
can be described by Poisson statistics, but correlations due to overall energy conserva- 
tion can reduce fluctuations by the Fano factor, whereas other instrumental effects can 
increase the error on the energy measurement. 


e For Poisson-distributed fluctuations, the energy resolution will scale with the number 
of quanta N as VN, and thus the relative energy resolution og /E « E —!/2. The en- 
ergy resolution will be better for larger N, i.e. for smaller energy per quantum. 


e Thus the best energy resolution can be achieved if the quanta are phonons or Cooper 
pairs. However, these can only be detected at sub-K temperatures. The next best en- 
ergy resolution can be achieved with semiconductors, and scintillation, requiring the 
most energy per quantum, gives the worst intrinsic energy resolution. 


e In high energy calorimeters, the energy deposition of the incoming particle will not be 
local, but gradual in the form of a cascade (shower) of secondary particles. Two main 
types of showers can be distinguished by the dominating interactions responsible for 
the shower development: electromagnetic and hadronic showers. 


e These cascades have characteristic scale lengths describing their dimensions. In 
electromagnetic calorimeters the longitudinal scale length is the radiation length, 
whereas the transverse dimension is driven by multiple scattering and therefore can 
be described by the Moliere radius. In hadronic calorimeters the longitudinal and 
transverse dimensions can be described using the hadronic interaction length. 


e While for low Z the radiation length and the interaction length are comparable, the 
interaction length is much larger than the radiation length for the materials used in 
typical particle physics calorimeters. 


e To maintain containment of the shower in the calorimeter, the size of the calorimeter 
must scale with log E. 


e As the energy degrades in the development of the shower, the interactions of the par- 
ticles in the shower with the detector material change. Typically, most of the energy 
is deposited in the latter stages of the shower development. Fluctuations in the early 
shower development can have a large effect on the shower dimensions and energy 
reconstruction of individual showers. 


Calorimetry 
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e A calorimeter that is made of one uniform material used to absorb the particle and to 


create the detection signal is called a ‘homogeneous calorimeter’. To reduce the size 
of a calorimeter often dense layers of absorber are interleaved with less dense active 
layers of detector that create the signal. Such a configuration is called a “sandwich 
calorimeter’. In the latter case the energy resolution is degraded by additional sam- 
pling fluctuations of the energy between energy deposited in the passive absorber and 
active detector layers. 


Hadronic showers are more complex, and in addition to an electromagnetic shower 
component, part of the energy goes into nuclear reactions. This energy fraction usu- 
ally is lost for detection. Thus, the energy recorded in the calorimeter will typically 
be different for different particles, and because of the fluctuations in the energy frac- 
tion in the different components the energy resolution will be significantly degraded, 
while the energy response will be non-linear. Hadronic calorimeters that achieve a 
similar response for different types of incoming particles can alleviate these effects. 
These are called “compensated calorimeters’. 


Calibration of energy scale and linearity are a necessity for all calorimeters. The cal- 
ibration requires electronic systems to convert raw ADC counts to charge or current. 
The conversion from charge or current to deposited energy has to be determined in 

a test beam using a prototype or slice of the full calorimeter. Different systems are 
required to transfer the energy scale to all cells in a large calorimeter. In collider ex- 
periments the knowledge of the energy scale, linearity and resolution can be improved 
with analysis of in-situ data. 


EXERCISES 


1. Rossi’s approximation B. 


A simple model of the initial stages of an electromagnetic cascade in matter assumes that photons 
travel one radiation length Xo before undergoing pair production and electrons (positrons) also 
travel one Xo before undergoing bremsstrahlung. The model also assumes that the energy is shared 
equally between the two final state particles. In this model further multiplication stops once the 
energy of the charged particles becomes less than Ec, and the charged particles lose energy to ioni- 
sation at a constant rate dE /dx. (This model is known as “Rossi’s approximation B” [432].) 


a) 


b) 


Use this model to show that the shower maximum for an electromagnetic cascade from an 
incident electron of energy E will occur at In(E/E,)/1n2. Estimate the range of charged 
and neutral secondaries after multiplication stops. Compare your estimates for an electron 
of 10 GeV with the results from the detailed simulations in Figure 11.3. 


Show that the number of particles in a shower in this model is E/Ee. 


What difference is there between the energy loss as a function of distance for elec- 
trons/positrons and photons (which is not modelled properly by the assumptions above? 
What consequence will this have for the energy depositions in showers started by elec- 
trons/positrons compared to showers started by photons? 
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d) What other aspects of the development of the electromagnetic cascade are not modelled 
by this simple model? Explain qualitatively the difference in the shower shape beyond the 
shower maximum in the simple model and the detailed simulation. 


2. Pointing accuracy and reconstruction of resonances. 


Consider the decay of the Higgs boson H — yy. The invariant mass of the two photons is M. 
Show that the resolution in the measurement of M (AM) is given by 


AM? AE? AE} (sina(Aa) \? 

M RPR ea 
where E; (E>) is the energy of the first (second) photon and @ is the space angle between the direc- 
tions of the two photons. The uncertainties in the measurements of E1, E2 and @ are AE) ,AE7,AQ@ 

AE? AE \2 

er © (EF): 
Consider such decays for which the polar angle of both photons is 0 = 45° and the difference in 
azimuthal angle between the two photons is Ad = 180°. If the energy resolution of an electromag- 
netic calorimeter is 1% for photons estimate the angular resolution for photons that would give the 
same contribution to the invariant mass resolution as that from the energy resolution. 


respectively. You may assume that 


Explain one method that could be used in an LHC detector to make such a precise angle measure- 
ment for photons. 
Electromagnetic calorimeter resolution. 


An empirical parameterisation for the energy resolution of an electromagnetic sampling calorime- 


ter is given by [221] 

OE = 00 dabs ° det =A 

E VE X x 
© scales with VZ, 0.6 < œ < 0.7 and 0.15 < B < 0.3. Consider a sampling electromagnetic 
calorimeter consisting of absorber plates of thickness 1 Xo. Using this equation, how does the res- 


olution scale with the value of Z for the absorber? What other factors are relevant for the choice of 
absorber material for an electromagnetic calorimeter? 


. Pion production threshold. 


Calculate the threshold energy in the lab frame for 2* production in the reaction m= p > m= mtn. 


. Neutron scattering. 


Show that a neutron with energy E scattering off a nucleus with atomic number A at an angle 0* 
in the CMS will transfer an energy to the nucleus (you may assume that the neutron is non- 
relativistic) 

2A(1 —cos 0*) 


es el 


Hence verify eq. (11.20). 


. Dual readout calorimeter. 


Consider a dual readout calorimeter. For each readout independently, the calorimeter is non- 
compensating, i.e. e/h 4 1. The two systems are calibrated with electrons. Stating any assumptions 
that you make, show that the energy of a hadronic shower can be determined independently of the 
fraction of electromagnetic energy fem and comment on the significance of your result. 
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7. A small Monte Carlo. 


a) To simulate random events from a Probability Distribution Function, f(x), we first define the 
cumulative distribution as F(x) = [tS f(x’)dx’. If the cumulative distribution is normalised 
to unity, then if we select random numbers y; from a uniform distribution between 0 and 1, 
we can obtain random numbers distributed according to the PDF f(x) by inverting the cu- 
mulative distribution, x; = F~! (y;). Use this method to generate random numbers from an 
exponential distribution f(x) = exp(—x/5) for 0 < x < æ. Plot a histogram of the generated 
numbers. Check that the mean of the simulated numbers is consistent with an analytic calcu- 
lation. 


b) Consider particles (A) of mass 5 GeV/c? produced with a momentum p = 5 GeV/c. The life- 
time of the meson is T = 1 ps. The particles decays isotropically in the particle rest frame to 
two particles (B) of mass 1 GeV/c”. These B particles have the same lifetime as A. Write a 
Monte Carlo code to simulate the decay chain and use this to plot the distribution of distances 
from the location of the decays of the B particles from the location of the production of the A 


particles. 


wm 


2 Particle identification 


In this chapter we will look at how we can use the detector techniques 
we have studied in previous chapters to identify different types of parti- 
cles. Knowing the type of particle can unlock a powerful unambiguous 
constraint for the kinematics of a particle, as this defines its mass. Fur- 
thermore, it defines the flavour, which can be used as a tag to identify the 
underlying physics process in an event. 

We will start this chapter with discussing four types of dedicated de- 
tector techniques that can be used on their own to distinguish between 
different types of particles: Time-of-flight (TOF), dE/dx, Cherenkov de- 
tectors and transition radiation detectors (TRDs). In the second half of 
the chapter we will then look at techniques to identify specific types of 
particles, which will usually require the use of the full power of a multi- 
detector particle physics experiment. 


12.1 PRINCIPLES OF PARTICLE IDENTIFICATION 
DETECTORS 


An unambiguous determination of the type of a particle is obtained if its 


mass can be measured. The mass is not directly accessible, but can be de- The two detectable particles with — 
masses that are too close to be easily 


termined, if the momentum is known, from a measurement of the velocity, directly resolved are the pion and the 
muon. Here the fact that the muon is a 
long-lived heavy lepton can be used for 


mc 1 1— B? _ 1 a2 D identification (see section 12.8). 


p By p? pol 


Several techniques do exist (measurement of time-of-flight, dE/dx, 
Cherenkov and transition radiation) and, as we will discuss, each of them 
performs best in a different regime of momentum of the incoming particle. 
The figure of merit is the separation power, which describes the discrimi- 
nation power between two particle hypotheses, and is given in units of the 


detector resolution o by The resolution for the two measure- 
ments are not necessarily the same. If 


they are not, a suitable average can be 
R | Gaa Ro used. 
ni2 = , 
(oy 


where the Rj 2 are the average responses of the detector for the two types 
of particles. 


12.2 MEASUREMENT OF TIME-OF-FLIGHT 


From the time-of-flight (TOF) ¢ of a charged particle measured between 
two thin detector layers separated by a distance L the velocity can be de- 
termined from B = L/(tc), which can then be used in eq. (12.1). The reso-  A/ernatively; the start signal for the 


7 5 r % time-of-flight measurement can come 
lution of the mass measurement is then given by (see exercise 1) from other arrival time constraints, for 


example from the bunch times of the 
2 
e r 
= rY + 
m p L t 


) i beam producing the charged particle, if 
DOI: 10.1201/9781003287674-12 242 


(12.2) they are known with sufficient precision. 
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Usually, in particle physics we are dealing with relativistic particles, so 
Y > 1 and thus it is the determination of the velocity that is the dominating 
contribution. For a typical separation of a few metres o,/L œ 107° is quite 
achievable, so that the time resolution becomes the limiting factor for the 
resolution. 

The TOF separation power between two particles of type | and 2 for 
the same momentum p is 


1 1\  L (Ei — E2) Le 
Bi B) peo, | ~~~ 2pro, 


Hence, the length of the TOF system has to grow as p? to maintain 
the same separation power if the momentum increases. For L = 1 m 
and ©; = 100 ps, 30 2/K separation is achieved up to a momentum of 
967 MeV/c. 

The most widely used TOF systems rely on plastic scintillators read 
out with PMTs, as they provide good timing resolution at low cost. In such 
a system, the timing resolution is the result of several contributions 


ti —to L 
O; 7 CO; 


(12.3) 


n2 = 


Cy 

P 2 

O = {/—— +04, 12.4 
' Neff el ( ) 


where Oph comprises errors for the detection of individual photons (photon 
emission delay, path length variations, transit time variations in the PMT, 
etc.), reduced by the effective number of detected photons, Ne, and Oel 
contains contributions from the electronics (noise, clock jitter, time walk, 
calibration etc.). 

The effective number of detected photons, Neff, is smaller than the 
number of photons created in the scintillator due to acceptance, attenu- 
ation and detection efficiencies. An empirical estimate is that approxi- 
mately 2 x 107? photoelectrons will be produced by a primary photon. For 
a minimum-ionising particle crossing | cm of plastic scintillator this gives 
Nege ~ 40. 

Alternative photon detectors can be MCPs (see section 4.4), or 
LGADs or SiPMs (see section 4.6). Gaseous detectors can also be used 
as timing detectors, if the gas layers are thin, and the timing resolution can 
be further improved by stacking several layers, like in multi-gap timing 
RPCs (see section 7.2). In these systems the timing resolution is given by 
equivalent expressions as in eq. (12.4). 
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For L = 3 m, © /t = 107? requires 
© = 100 ps. 


Using (for p > mc) 

E= ype ni 
4/1- me /p? 
~ pe(1 -m2 /2p°). 


= pe 


See section 5.1. 


Position resolution requirements in TOF 
systems are usually moderate. 


In large TOF systems with long scintil- 
lators photon path length variations can 
be reduced by using the average time of 
two PMTs recording the light output at 
the two ends of the scintillator. 


Assuming (dE/dx) mip = 2 MeV/c’, 
p = 1 g/cm’, and 25 eV required to 
produce a photon. 


Figure 12.1: Separation in the ALICE 
TOF system (multigap RPCs with a 
system time resolution of around 80 ps) 
[280] (original data from [37]). 
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12.3 MEASUREMENT OF dE/dx 


In section 2.3 we have seen that the ionisation energy loss of charged par- 
ticles can be parameterised as a function of By, independently of the type 
of particle. If the amount of energy deposited in a detection layer can be 


measured, then this can be used for a velocity measurement, which can be 


again combined with a measurement of the momentum in a spectrometer 
to determine the particle mass. 

Due to the shape of the energy loss function, separation is relatively 
easy for momenta smaller than for minimum-ionising particles, but then 
becomes very challenging for MIPs. It becomes possible again for mo- 
menta in the range of the relativistic rise until it deteriorates again for the 
Fermi plateau. 


3 Ť 


rrr T 10 


mrar T 


electrons : 1 
= — pions 


0 boil ul raru 0 poh Ne al 
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The challenge for the measurement of the deposited energy by ionisa- 
tion is the straggling of the energy loss. In particular for gaseous detectors 
the measurement of the energy loss is severely hampered by the fluctu- 
ations of the energy transferred in individual ionisation clusters, and the 
significant probability for very high energy transfers (6-electrons) leads to 
pronounced tails in the straggling distributions. 

In practice the energy measurement in a gaseous detector is divided 
into a measurement of the ionisation charge deposited in a number of cells 
of the gaseous detector, with the charge in each cell collected by one read- 
out channel. Averaging these measurements can reduce the error on the 
energy measurement. A common approach is to exclude a fixed fraction 
(typically about 30%) of the energy measurements with the highest energy 
depositions, to get a result which is less affected by the high energy depo- 
sition fluctuations (‘truncated mean’). For a fixed length detector there is 
an optimum number of energy measurements, as the increased straggling 
when the detection volume is divided into smaller cells negates the advan- 
tage of the increased statistics of individual measurements. 

A likelihood fit approach should in principle be mathematically more 
powerful than the truncated mean, as it allows to keep all data points. This 
is particularly relevant if only a small number of samples are taken, for 
example in silicon detector systems. However, this method is significantly 
more elaborate, and there is no evidence that the results are significantly 
better. 

An alternative approach is not to rely on the measurement of total 
charge deposited and registered in the detector, but to count the number 
of ionisation clusters, which eliminates the direct effects of cluster size 
and gas amplification fluctuations [491]. The challenge here is the high 
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Only in gaseous detectors the relativis- 
tic rise is large enough (typically 50% 
to 60%) to allow for separation above 
the MIP momentum. In solids the rise 
is limited by the density effect to about 
10% (see section 2.3). 


Figure 12.2: Left: Average energy loss 
as a function of momentum for different 
particles in Ar/CH4 93/7 (using the 
energy loss function shown in Fig- 

ure 2.16). Right: Resolution power for 
Og /E ~ 5% for selected discrimination 
channels. 


See section 2.5. 


The charge collection and amplification 
process required in gaseous detectors 
will introduce additional stochastic 
fluctuations and possible non-linearities 
for the conversion of energy into an 
electronic signal. 


Sometimes also the lowest 10% of the 
energy measurements are discarded, 
to make the distribution even more 
symmetric. 


Another way to reduce straggling is to 
increase the pressure of the gas, but 
this comes at the cost of reducing the 
height of the relativistic rise due to the 
increased density effect. 


A compromise is a single parameter 
likelihood, where the shape of the sam- 
pled energy distribution is assumed to 
be universal. However, this assump- 
tion does not apply for thin sampling 
layers [48]. 
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bandwidth (order 100 MHz) required to resolve individual ionisation clus- 


ters [183]. 
In practice, with the truncated mean approach a resolution of 
o£ /E ~ 5% is typically achieved. 
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12.4 CHERENKOV DETECTORS 


A more powerful particle identification technique at higher momentum 
relies on the measurement of the photons emitted by the Cherenkov effect 
in a transparent radiator. 

As discussed in section 2.8, the number of Cherenkov photons emit- 
ted per unit of track length of the incoming particle is not large, and the 
number of photons detected will be even smaller, due to attenuation, ac- 
ceptance and detection efficiencies. Maximising the number of detected 
photons is thus a central goal of the design of Cherenkov detectors, in par- 
ticular if the emission angle cnp is to be determined and a large number of 
measurements is needed to reduce the errors associated with the difficult 
identification of the direction of the photons. 

Cherenkov detectors consist of at least two parts, a radiator, in which a 
sufficient number of photons is emitted, and a photon detector, where they 
are detected. In many cases optical elements (mirrors or other optical in- 
terfaces) are placed in between, which allow to move the detectors outside 
of the active volume of the detector and/or focus photons from one track 
into the same region of the photon detector. 


RADIATORS 


Radiators need to be transparent for photons with wavelengths for which 
the photo-detector is sensitive (see Figure 7.30). If the Cherenkov angle 
is measured they also should have a small dispersion in the energy range 
of detected photons. For observation of Cherenkov radiation a radiator 
material must be chosen so that the Cherenkov threshold condition Bn > 1 
is met for the velocities of particles in the experiment. The Cherenkov 
angle then increases with f, until it reaches a maximum angle of Omax = 
arccos(n—!). 

We have seen in section 2.8 that the density of photons emitted per 
energy and per unit of track length is proportional to sin? Och. Hence, the 
number of photons increases for larger Bn. For relativistic particles this 
means that this number is larger for optically denser materials, typically 
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Figure 12.3: Specific energy loss 
(dE/dx) in the ALICE TPC versus 
particle momentum in Pb—-Pb collisions 
at \/Snn = 2.76 TeV. The lines show the 
parameterisations of the expected mean 
energy loss [41]. 


For a detailed review of Cherenkov 
detectors see [386]. 


Due to the absence of a continuous 
energy deposition along the path of 

a photon, its direction can only be 
deduced from the point of emission and 
the point of detection of the photon. As 
detection destroys the photon, the point 
of creation cannot be directly measured 
and can only be inferred from other 
constraints. 


Photon emission from scintillation or 
fluorescence should be small. 


246 


solids or liquids. At the same time the Cherenkov angle will be larger for 
such materials, so that Cherenkov detectors made of such materials will 
typically be more compact than for materials with smaller index of refrac- 
tion, where greater depth of the radiator and distance for the separation of 
the photons is required. 

The maximum number of emitted photons is obtained for the max- 
imum Cherenkov angle and can be found from eq. (2.47), assuming no 
dispersion (n constant), 


dN. Za 1 
( a = = (Emax Emin) (1 = =) . 


For small angles sin Och ~ Ocn, and the turn-on is given by 


dNy/dx On Po Yh 
(dNy/dx) 0 OF 7 ? 


where %3 = (1 —1/n”)~!/?, and we have assumed that y > 1. 

Widely used solid radiator materials are transparent crystals like quartz 
glass (fused silica, SiOz), LiF or CaF . For liquid and gaseous radiators 
fluorocarbons are a common choice due to their low dispersion (C¢Fi4 is a 
liquid at STP, whereas CF4, C2F6 or C3Fg are gaseous). Large Cherenkov 
detectors in neutrino physics use water as a radiator. For the region of n 
between gases and solids/liquids, aerogel can be used. 


Quartz glass 


_ oon aerogel 
10° E E 
[5e o pem 
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photon energy [eV] 


In layers of optically dense materials with a downstream surface per- 
pendicular to the incoming track and backed by a gas (air), the Cherenkov 
angle is typically so large that the radiation is internally reflected on the 
surface of the radiator. One way to avoid this is to give the radiator a ‘saw- 
tooth’ shape [72]. Alternatively, multiple internal reflections in a planar 
radiator can be used to guide the photons to a photon detector outside the 
acceptance of the detector (this principle is called ‘detection of internally 
reflected Cherenkov’ radiation — DIRC) [16]. To allow for the multiple 
reflections a high-n material with low photon attenuation and highly pol- 
ished surfaces has to be used. 

The extended photon path due to the multiple reflections can help to 
correct for the chromatic dispersion. In a dispersive medium the phase and 
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The Cherenkov angles are small in 
radiators with low indices of refraction 
(gases). 


One drawback of these is that they are 
potent greenhouse gases. 


Aerogel is a synthetic porous ultralight 
material, most often made of Silica 
(SiO2). 


Figure 12.4: Index of refraction 

for some radiator materials (data 

from [480, 250]). The index of refrac- 
tion for aerogel can be tuned between 
1.008 and 1.1, depending on the density 
of the material. 
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Figure 12.5: Principle of a ‘sawtooth’ 
radiator for the CLEO III RICH [72]. 
Top: planar radiator, bottom: ‘saw- 
tooth’ radiator. 
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PMT + Base __\ 


Figure 12.6: Schematic of the fused 
10,752 PMTs S4 al of the fi 


silica radiator bar and imaging region 
of the BaBar DIRC detector [16]. 
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group velocities are different. This can be described by two different types 
of index of refraction, one for the phase velocity, ng, and one for the group 
velocity, ng. The former defines the Cherenkov angle, whereas the latter 
determines the propagation velocity of the photon in the medium. They 


d 
3 
T 

1 


in 

a 
T 
1 


Nerol SiO) 
crouel SIO, 


tn 

3 
T 
f 


= ee 
— os a Nprase(SO,) 


refractive index n 


j 
| 
| 
| 


are related by 140 F l | 
1 2 3 4 5 
E_photon [eV] 
(6 dk d(ck) d(@n ) dn Figure 12.7: Dispersion curves in 
Ng c ? =No +E aiy ; quartz glass for the phase velocity (ng) 
Cg da da da dE and for the group velocity (ng) [280]. 


where F is the energy of the photon. Thus a concurrent measurement of 

the Cherenkov angle and the time-of-flight of the photon can be used to a sia ear this 
determine the wavelength of the photon. Examples for DIRC-type detec- l 
tors using timing information to improve particle identifications are iTOP 

at Belle II [245], and TORCH, proposed for the LHCb detector [135]. 


PHOTON DETECTION 


In principle, all detector technologies sensitive to optical (few eV) photons 
discussed in previous sections of this book can be used as photon detectors 
for Cherenkov detectors, in particular PMTs (section 4.4), silicon detec- 
tors like APDs (section 4.6), or gaseous detectors with TEA or TMAE, or 
with CsI photocathodes (section 7.6). The trend for imaging Cherenkov 
detectors requires improved segmentation. Early imaging detectors relied 
on gaseous detectors based on the time-projection principle [57, 17, 9], 
followed by gaseous detectors with cathode pad readout, enabled by in- 
tegrated readout electronics [72, 225]. In all these systems the photon de- 
tector was in the path of the incoming charged particle and material in the 
photon detector therefore needed to be kept low. To contain the detector 


gas, windows transparent in the relevant wavelengths are required. These can be made of quartz glass for 
z A TMAE- and Csl-based and must be 
In fixed-target experiments and DIRC-type detectors the detection made of CaF» for TEA-based gaseous 
elements can usually be placed outside the detector acceptance, so that detectors: 


material is less of a concern. The inclusion of photon time-of-flight mea- 
surements introduces new demands on the time resolution of the photon 
detectors, and the detectors considered for next-generation RICH detectors 
are MCPs (section 4.4) and SiPMs (section 4.6). 
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THRESHOLD CHERENKOV DETECTORS 


The simplest type of Cherenkov detector just detects the presence of 
Cherenkov radiation. This can be particularly useful for a hadronic beam, 
where particle momenta have been selected in a suitable magnet system. 
In this case the momentum is known and singular, and a Cherenkov detec- 
tor with a radiator with a suitably chosen index of refraction can provide 
discrimination of the flavour of the particles in the beam. 

A rough estimate of the incoming particle’s velocity can be obtained 
from measuring the magnitude of the light signal, as the number of emit- 
ted photons is proportional to sin? Och. 


RING IMAGING CHERENKOV DETECTORS 


A direct measurement of the velocity can be achieved if the angle of in- 
dividual Cherenkov photons can be measured. Typically, this will involve 
the measurement of as many photons as possible to reduce the error on the 
angle measurement for individual photons. Depending on the geometry 
and optics this will often, but not necessarily, result in a circular or ellipti- 
cal detection pattern. 

From the relation for the Cherenkov angle cos @ch = (nB)~! it fol- 
lows that dB /d@c, = £ tan Ocn, and thus for the resolution of the velocity 
measurement with a single angle measurement 

Op 
— = tan Och Obon: 


B 


Contributions to the angle measurement error 6g,,, come from chro- 
matic dispersion, the photon position detection error (usually dominated 
by the detector segmentation), uncertainties in the knowledge of the emis- 
sion point and imperfections of optical surfaces used to reflect the pho- 
tons. Most of these errors are uncorrelated for different photons and as in 
state-of-the-art imaging Cherenkov detectors typically a few tens of pho- 
tons are detected for each charged particle, the single-photon resolution 
can be (assuming that the angle measurement errors are the same for all 
photons) divided by VN, with N the number of detected photons. Typical 
event angle measurement errors Ogo / VN are between 0.1 and 5 mrad. 

In an imaging Cherenkov detector the signal is given by the number 
of photons on the ring or similar pattern and thus the separation power is 
given by 


O Asin? Och B (mî — m3) e 
ba Osin? oa VN p 2k, p? 
with 
k= n sin Och cos Och : 
where O n2 Och and Og, are the measurement errors for a single photon 


for sin? Oca and Och, respectively. The factor k, is often approximated for 
relativistic particles (B œ 1), for example as k; ~ tan Och O¢q, / VN [510], 
or as k ~ Vn? — 1 Oec, / VN [505]. 

Two main geometries are used to achieve an image of the photons on 
the detector planes. 
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For a suitable choice of the index of re- 


fraction, a Cherenkov signal is evidence 
for a pion, while the absence of a signal 


would indicate a kaon. 


Reconstruction of the paths of several 
photons can be used to reconstruct 

the Cherenkov cone and thus supply 
information about the track parameters 
of the incoming particle. 


The distributions for these errors usu- 
ally do not have long tails (as opposed 
to charged particle dE /dx). 


g 
1 2 
sin? Och = 1— a= (=) 
n pn 


and 


O2 


in OG, 2 sin Bcn cos Och Ooch: 


k, is often called the ‘RICH detector 
constant’. 


For an angle measurement error of 

2 mrad and a radiator with n = 1.5, 

k, ~ 2.2 x 10-3 and the n-K separation 
power at 4 GeV is about 3.1. 


Typically, but not necessarily the images 
are rings. 
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Proximity focusing Cherenkov detectors 


In this geometry the photons are emitted over a limited distance of the 
path of the primary charged particle. Typically, this uses a limited thick- 
ness of solid or liquid radiator, to yield a large enough number of pho- 
tons. The limited depth of the radiator provides a constraint on the point of 
emission for the photons. For a radiator depth d and a much larger prox- 
imity gap / between the radiator and the detector the error on the angle 
measurement is given by 


~Ia 


A special case of proximity focusing are large volume neutrino ex- 
periments, where the medium is typically water. Here the limitation of 
the emission points is due to the limited distance (compared to the size of 
the tank) the leptons created in charged-current interactions travel before 
their energy is used up and they come to a stop. As there is no informa- 
tion on the track parameters of the incoming particle, the direction must be 
inferred from the shape of the image. Photon arrival times due to time-of- 
flight can support the determination of the direction of the lepton. Identifi- 
cation of the lepton type can be obtained from the sharpness of the image, 
as electrons initiate electromagnetic showers, with charged secondaries 
slightly off-axis from the originating electron, and thus the Cherenkov im- 
age is more blurred than for muons. 


Mirror-focusing Cherenkov detectors 


Alternatively, the photons can be focused by concave mirrors, with the de- 
tectors in the focusing plane of the mirror. Parallel photons are focused to 
the same detector location, and the Cherenkov cone onto a ring. In prac- 
tical applications, the mirrors are segmented to allow for planar detector 
geometries, and to limit the depth of the Cherenkov detector system. 

One advantage of the mirror-focused system is that there is no funda- 
mental limit on the depth of the radiator, and thus this geometry is com- 
patible with low-density radiators. 

To optimise separation performance over a wide momentum range 
mirror-focusing RICH systems often comprise more than one radiator ma- 
terial. An example of a combined proximity- and mirror-focusing RICH 
system has been the DELPHI RICH, which used a liquid CF14 radiator 
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Figure 12.8: Schematics of a proximity 
focusing Cherenkov detector. 


Low energy muons will have a very 
short range compared to the dimensions 
of the detector and will produce sharp 
rings as shown in the figure. Higher 
energy muons can have a significant 
range (for example, 2 GeV muons have 
a range of ~ 10 m) and therefore the 
inside of the ring will tend to be filled in 
by Cherenkov radiation from the muon 
as it propagates through the detector 
towards the detection surface. However, 
there will still be a sharp outer edge 
and the pattern can still be efficiently 
distinguished from that caused by 
electron showers. 


Figure 12.9: Example event displays 
from the Super-Kamiokande detec- 

tor [297] (© Deutsche Physikalische 
Gesellschaft. Reproduced by permission 
of IOP Publishing. CC BY-NC-SA). 
Left: electron event, right: muon event. 
The event displays show the unwrapped 
instrumented surface (11,000 photomul- 
tiplier tubes) of the cylindrical water 
tank (height and diameter ~ 40 m). 


Hence the acronym RICH, for ‘Ring 
Imaging Cherenkov’ detector. 


This is true for parabolic mirrors, for 
spherical mirrors the focus will not 

be in a point, and variations in the 
emission point will result in an error on 
the angle measurement. 
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in a proximity focusing geometry and a C5F12 gaseous radiator with mir- 
ror focusing. The photons were detected in a CH4/TMAE gaseous drift 
chamber with a long (1.6 m) drift. 
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The LHCb RICH system consists of two detectors, RICH1 and 
RICH2. In both detectors a mirror system consisting of a spherical mir- 
ror and a plane mirror deflects photons out of the acceptance for charged 
particles and focuses them onto a photon detector system consisting of hy- 
brid photodiodes (HPDs). RICH1 uses aerogel (only in run 1) and C4F10 
as radiators, CF4 is used in RICH2. 
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12.5 TRANSITION RADIATION DETECTORS 


As discussed in section 2.9, transition radiation in the X-ray regime 
has an intensity that is proportional to y, and thus a measurement of 
this intensity, together with a measurement of the momentum, can be 
used to determine the mass of the charged particle. It is evident that this 
works best for ultra-relativistic (y >> 1) particles. Up to high momenta 
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Figure 12.10: Schematics of a mirror- 
focusing Cherenkov detector. The 
detector plane can also be outside of 
the acceptance for incoming charged 
particle tracks. 


At collider experiments RICH detectors 
usually have a mirror tilt to project the 
image outside of the detector, which 
introduces non-perfect focusing that 
results in chromatic aberration. 


Figure 12.11: DELPHI barrel RICH. 
Left: Quadrant view (modified 

from [69]). Right: Photoelectron scatter 
plot of 200 superimposed events in a 
prototype (beam at 18° relative to the 
normal at z = 142 cm) [69]. The large 
parabolic pattern is from the liquid ra- 
diator, the small circle from the gaseous 
radiator. Other features are from quartz 
windows of the photo-sensitive region. 
The ionisation electrons are drifted 
along the length of the detector to 

the end on the right, where they are 
detected by a multi-wire detector. 


The aerogel radiator was removed 
after run I as its ability to provide 

PID below the Cherenkov threshold 
for kaons for the C4F\o radiator was 
limited by the challenges to reconstruct 
the photons in the high track density 
environment, while its removal improves 
the performance of the gas radiator, as 
it removes the obstruction for photons 
upstream of the aerogel radiator [397]. 
In 2022 the HPDs have been replaced 
with multi-anode PMTs. 


Figure 12.12: LHCb RICHI [332]. 
Left: Side view cross-section. Right: 
Cherenkov angle versus particle mo- 
mentum for the LHCb RICH radiators. 
Indices for refraction are n = 1.03 for 
aerogel, n = 1.0014 for C4Fio, and 

n = 1.0005 for CF, (in RICH2, not 
shown). 


For good summaries of transition radia- 
tion detectors see [59, 505]. 


Particle identification 


(~100 GeV/c) only e™ will emit significant TR intensity, so that the typ- 
ical application of TR detection is to distinguish e™ from other charged 
particles in that energy regime. The commonly used figure-of-merit for a 
transition radiation detector (TRD) is the efficiency for mis-tagging a pion 
as an electron for a given electron identification rate (typically 90%). 


RADIATORS FOR TRANSITION RADIATION DETECTORS 


The number of photons emitted at a single interface is @(z7a), where ze 
is the charge of the incoming particle, and thus a large number of inter- 
faces is needed to obtain a detectable signal. At the same time, attenuation 
needs to be limited. The dominant photo-absorption mechanism for pho- 
tons with about 10 keV is by the photo-electric effect, with an absorption 
length that scales like Z~” with m between 3 and 5, whereas the radiated 
energy increases only with @, ~% vZ. Thus, it is necessary to use a radia- 
tor material with a low Z. In addition, the material should be structurally 
stable, so that the interface surfaces are maintained. Early TRDs used foils 
from low-Z metals like lithium or beryllium. However, these metals are 
difficult to handle and more recent TRDs use polypropylene, (C3H6)n, or 
Mylar. These materials can be used as foils, foams or fibre mats. 


PHOTON DETECTION FOR TRANSITION RADIATION 


The typical energy of photons that escape a radiator stack is a few 10 keV, 
as the lower energy photons are more strongly absorbed in the radiator 
material. The two challenges for the photon detector in TRDs are the low 
interaction cross-section for photons with this energy, and the small emis- 
sion angle of the TR, which usually makes it impossible to separate the 
charged particle energy loss of the incoming particle from the energy de- 
position of TR photons. The energy deposited by TR photons is typically 
significantly larger than the local average charged particle energy loss, but 
energy transfers in the tail of the straggling distribution (6-electrons) can 
create signals of similar size, and thus fake a TR photon. 

To maximise TR photon absorption the detector material should have a 
high Z, and the most common choice of photon detectors are gaseous de- 
tectors with xenon as the main gas component. As usual, the noble gas 
xenon needs to be combined with a suitable quencher gas like CH4 or 
CO2. Two approaches can be taken for the design of a TRD. Either, the 
detector consists of thin elements, which allows for a fast response in a 
high-rate experiment, or it is deep, which can provide better segmenta- 
tion beneficial in experiments with high track densities. In both cases it is 
common to include the position information for the charged particle in the 
general tracking of the experiment. 

The first approach was taken in the ATLAS TRT, where the detection 
elements are straw tubes made of polyimide with a diameter of 4 mm, op- 
erated with Xe/CO2/O> 70/27/3. There are 52,544 straws in the barrel, and 
122,880 in the endcaps. The straw tubes are embedded in the radiator that 
is made of polypropylene-polyethylene fiber mats. The absorption length 
for the lowest energy photons of interest (5 keV) in the radiator material 
is about 17 mm. Hits are identified with two different thresholds, a low 
threshold used for tracking (at ~15% of the average signal expected for 
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TR has also been used for the identifi- 
cation of pions in hadron beams above 
100 GeV/c [209, 127]. 


See section 2.1. 


Z is the atomic number of the material. 


Lithium is flammable, and beryllium is 
toxic. 


Mylar is a trade name for a polyester 
film made from stretched polyethylene 
terephthalate (PET). 


The spectrum tends to be harder for 
thicker stacks. 


Xenon gas is also the detector material 
with the highest relativistic rise of the 
charged particle energy loss (~75%), 
which further increases the signal from 
charged particle loss for a particle with 
a large relativistic y-factor. 
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a minimum-ionising particle, about 250 eV), and a high threshold for the 
identification of TR photons (at about 6 keV). 


Radiator Sheets 
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Figure 12.13: ATLAS TRT. Left: Isometric view of a barrel module [103]. Right: Probability of a TRT high-threshold hit for pions and electrons as 
a function of y in 7 TeV collision events [142]. For the identification of particles the information from the whole TRT, comprising a large number of 
straws (@(30)), is combined. The efficiency for electron identification of the TRT depends on the threshold of TRT hits required, but this is usually set 


to achieve ~90% efficiency for electron identification. 


The second approach has been taken by the ALICE collaboration. The 
ALICE TRD consists of a radiator comprising a carbon-fibre-laminated 
Rohacell layer and polypropylene fibre mats with a total thickness of 
48 mm, and a gaseous detector filled with Xe/COz 85/15, which con- 
sists of a drift section of 30 mm thickness, and a 7 mm thick multi-wire 
proportional chamber section with pad readout. In this detector particles 
with y > 1000 will produce about 1.45 X-ray photons in the energy range 
of 1-30 keV from TR. The highest probability to be absorbed in the gas 
occurs when the photons enter the active volume (corresponding to the 
largest drift time), decaying exponentially on a scale given by the absorp- 
tion length. 
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For the identification of TR photons two different strategies can again 
be used [227]. Either the total charge deposited in the detector for the 
passage of the charged particle (dE /dx+TR) is measured after ampli- 
fication by a charge sensitive amplifier and digitisation in an ADC (‘Q- 
method’), or the number of clusters exceeding a threshold well above the 
average charged particle energy loss is counted (‘N-method’). The sec- 
ond approach has the advantage that the number of such clusters follows a 


Rohacell® is the trade name for a 
closed-cell rigid expanded foam plastic 
based on polymethacrylimide (PMI). 


Figure 12.14: ALICE TRD [59]. Left: 
Schematic cross-sectional view of a 
detector module. Right: Average pulse 
height as a function of drift time for 
pions (triangles), electrons without a 
radiator (squares) and electrons with a 
radiator (circles), all with p = 2 GeV/c 
momentum. 
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Poisson distribution with a less prominent tail than of the straggling distri- 
bution of the energy loss of the charged particle, but it can be affected by 
space charge effects and diffusion of the ionisation charge during its drift 
in the detector gas. The two different methods require different optimisa- 
tions of the readout electronics and the detector design. 

In TRDs with gaseous detectors, the position resolution is usually in- 
sufficient to separate the photons from the charged track and thus a direct 
measurement of the emission angle is impossible. However, the better po- 
sition resolution in semiconductor detectors can be used to to resolve the 
angle, and thus to improve the PID performance (see for example [50]). 
The separation can be enhanced by a magnetic field between the radiator 
and the detector. 
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PERFORMANCE OF TRANSITION RADIATION DETECTORS 
The design choices for a TRD can be affected by many secondary con- 
siderations (speed, segmentation, performance as a tracking device, use 
in trigger systems, etc.), but, as experience shows, the design parameter 
which effectively drives the performance for its primary function, particle 
identification based on the detection of TR, is the depth of the TR detec- 
tion system. Pion fake rates below 1% for electron identification rates of 
90% have been achieved with detectors with a depth of several tens of cm. 
[ @ NOMAD Figure 12.16: Pion efficiency measured 
o KEK (or predicted) for different TRDs as a 
ore A UA2 function of the detector length for a 
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12.6 COMPLEMENTARITY AND COMBINATION OF 
PARTICLE IDENTIFICATION TECHNIQUES 


From the discussion in this chapter so far it should be clear that the differ- 
ent dedicated particle identification detectors perform differently in differ- 
ent kinematic regions. No technique works well for all particles and mo- 
menta, so that the choice of PID system depends very much on the specific 
experiment. Some of the PID detectors can also serve additional functions 
like contributions to tracking, triggering, and/or pattern recognition. 
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The different approaches are uncorrelated, and thus the combination of 
the results, if such systems are present in an experiment, can improve the 
overall PID performance. 
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12.7 ELECTRON IDENTIFICATION 


Identification of electrons in the presence of hadrons will use the very 
different shower profiles of electrons and charged hadrons that we con- 
sidered in chapter 11. Electron identification is particularly challenging 
in a hadron collider like LHC because of the very large cross-section of 
hadrons from QCD jets. We will therefore focus our discussion on elec- 
tron identification at hadron colliders, but the principles could be applied 
to other environments. 
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Figure 12.17: Approximate minimum 
detector length required to achieve a 
K/t separation of nx x = 3 with three 
different PID techniques [339]. For 
the energy loss technique a gaseous 
detector is assumed. For the TOF 
technique, the detector length represents 
the particle flight path over which the 
time-of-flight is measured. For the 
Cherenkov technique only the radiator 
thickness is shown. The thicknesses of 
an expansion gap and of the photon 
detectors have to be added. 


The ALICE experiment at the LHC 
incorporates all the PID technologies 
discussed here [42]. 


Figure 12.18: Particle identification 
by simultaneous dE /dx and TOF 
measurement in the momentum range 
5-6 GeV/c for central Pb-Pb collisions 
in the NA49 experiment [24]. 


We will focus on identifying electrons 
at high values of transverse momentum 
(pr) because of the very large hadron 
backgrounds at low pr. 


Particle identification 


The key differences between electrons and hadron showers are the 
short longitudinal shower profile and the narrow transverse shower profile 
of the former. Therefore, if we have a calorimeter with fine granularity, we 
can use the lateral and the longitudinal shower profile to provide powerful 
electron/hadron separation. 

As we saw in chapter 11 the shower profile from photons is very sim- 
ilar to that from electrons. Therefore, a calorimeter alone cannot provide 
powerful rejection against backgrounds from 7°? — yy. This type of back- 
ground can be rejected by requiring electron candidates to have a good 
matching between the direction and momentum of the track reconstructed 
in the inner tracking system, and the energy and location of the shower in 
the electromagnetic calorimeter. 

Another background arises from photon conversions in the detector 
material upstream of the calorimeter. If the photon conversion is suffi- 
ciently asymmetric the energy of either the e~ or e* can be large enough 
that the reconstructed track momentum is compatible with the measured 
calorimeter energy. This type of background can be suppressed by recon- 
structing the tracks of both the e~ and e* or by vetoing electron candi- 
dates which do not have a hit in the first layer of the tracking detector. 

In a hadron collider experiment it is often necessary to distinguish 
‘prompt’ electrons from electrons originating in semi-leptonic decays of 
heavy flavour hadrons (b and c hadrons). In general, there will be no other 
high energy particles close to a prompt electron. The electrons from heavy 
flavour decays on the other hand arise from a b or c jet and therefore will 
invariably be accompanied by high energy hadrons. We can therefore 
strongly suppress the background from electrons from heavy flavour de- 
cays by ‘isolation’ cuts. The cut will typically require a low number of 
charged hadrons and/or energy in the calorimeter close to the electron. We 
can reduce the background from photon conversions in the detector by re- 
quiring that the candidate electron track has a hit in the first layer of the 
vertex detector (see exercise 4). 

These general principles will apply to all collider experiments but the 
details will differ. We will consider electron identification in the ATLAS 
detector as an interesting example [91] because of the fine lateral and lon- 
gitudinal sampling of this electromagnetic calorimeter. 
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Additional suppression of this type of 
hadron background can be achieved 
using transition radiation detectors (see 
section 12.5). 


This provides one motivation for trying 
to minimise the material (in units of 
radiation lengths) in tracking detectors. 


In a hadron collider experiment (e.g. 
LHC) the cross-section for hadronic 
jets is many orders of magnitude larger 
than that for the production of prompt 
electrons from processes like W —> eV, 
and Z > ee. 


Figure 12.19: Schematic of the ATLAS 
tracking detectors and calorime- 

ters used for electron identification. 
The dashed track before the pre- 
sampler shows the path of a photon 
that has resulted from electron 
bremsstrahlung [91]. 


256 


The electron reconstruction and identification consists of four principle 


steps: 


1. 


Seed Cluster. The energy in the electromagnetic calorimeter in the 
different layers is summed within ‘towers’ of size An x Ad = 
0.025 x 0.025. A ‘sliding window’ algorithm is then used to find 
clusters of 3 x 5 towers in 7 and @. The sliding window algorithm 
works by summing the transverse energy (}, Er) in 3 x 5 towers and 
moving the centre of the window over the calorimeter to look for 
clusters with } Er > 2.5 GeV. 


Track matching. A reconstructed track should point to the cen- 

tre of the electron cluster within |Nctuster — Ntrack| < 0.05 and 

—0.1 < @ctuster — Ptrack < 0.05. The backgrounds from photon con- 
versions are suppressed by requiring a minimum number of hits on 
the track and by requiring a hit in the innermost pixel layer. 


Electron identification. Several calorimeter and track variables are 
used to efficiently identify genuine electrons and reject backgrounds 
from QCD jets. These will be discussed in more detail below. 


Electron isolation. Prompt electrons (e.g. from processes like 
Z — e*e~) can be separated from electrons from decays of b or c 
hadrons using energy around the electron cluster. 


There are several variables which contribute to the electron identifi- 
cation stage. Some of these variables are described to illustrate how the 
general principles of electron identification are implemented in practice. 


Longitudinal shower profile. The ratio of the transverse energy in 
the electromagnetic calorimeter cluster to the transverse energy in 
the corresponding cells of the hadronic calorimeter. 


Transverse shower profile. The energy-weighted width in the 
second layer of the electromagnetic calorimeter is defined in 
terms of the energy in cell i, E; and the pseudorapidity 7; as 


V (XiEin?) /LiBi— ((LEin') /LE,)”. A similar variable, Rọ is 
defined as the ratio of energy in a region of 3 x 3 cells over that of 
the energy in 3 x 7 cells (centred on the electron cluster). 


Track quality. The number of hits in the silicon tracker are used to 
suppress photon conversions, and the transverse impact parameter 
divided by the uncertainty, dy /o (do), to suppress electrons from 
heavy flavour decays. 


Transition radiation. The information from the transition radiation 
tracker (TRT) can be used to further suppress backgrounds from 
charged hadrons. 


Track-calorimeter matching. The precise matching of the track- 
projected impact point at the calorimeter and the calorimeter cluster 
centre-of-gravity provides rejection against all backgrounds apart 
from photon conversions. The matching between the measured 
calorimeter energy E and the momentum p of the matched track 
provides further background rejection. 
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n is a polar angle variable defined 
by eq. (2.55) and @ is the azimuthal 
angle. 


The width is larger in @ to ac- 
commodate the effects of electron 
bremsstrahlung. 


The details will change for different 
experiments. Here it will be described 
for the case of the ATLAS detector. 
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Some of these variables can be used in a simple selection algorithm. 
Other variables can provide some statistical separation between the elec- 
tron signal and background, but do not show a clean separation between 
signal and background. Therefore the analysis combines many variables 
but as there are many variables a simple series of selections on each vari- 
able (‘cuts’) will tend to be inefficient. 

A better approach for electron identification is a likelihood method!. 
The likelihood for signal electrons (background) for a given event is de- 
fined as 


Ly (®) = | [Psa sa), (12.5) 
i=] 


where the vector x represents the n variables used and Ps(p)(x;) are the 
probability distribution functions (PDF) for signal (background) for the 
individual variables evaluated at the measured value x;. The detector per- 
formance varies strongly with 7) and electron transverse energy Ey. There- 
fore, the signal and background PDFs are evaluated in bins of 7 and Er. 
The discriminant to separate electrons from backgrounds is then defined 
by 


dı =L,) (Ls +p). (12.6) 


For presentation purposes, a scaled variable is used that is defined as 
di = tIn(d,!—1). 

While no one variable provides very good discrimination between the 
electron signal and the background, the combined likelihood function does 
achieve very good separation. This is illustrated in Figure 12.20, which 
shows the distribution for signal and background for one such variable, Rọ 
and the likelihood discriminator dj. 
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A cut is then made on a particular value of d} and the electron effi- 
ciency can be traded off against background rejection. 

Finally, if we want to identify electrons from prompt processes 
(e.g. Z — ete) as opposed to semi-leptonic decays of heavy flavours, 


'This provides a significant improvement on the simple cut analysis. However, it does not 
allow for the significant correlations between the variables. An ideal algorithm would use a 
multi-dimensional likelihood, which combined all the variables. Consider the simple case of 
a likelihood based on two variables for each event, x;, y;. Ignoring the correlations between 
the two variables, the likelihood for signal and background would be L = Ps) (xi) Ps) 01). 
If we allow for the correlations between x and y using a joint probability distribution function 
P(x,y) the likelihood for signal (background) would be L = Psp) (xi, yi). However, this would 
be computationally impractical when the number of dimensions is large. An alternative 
approach is to use a machine learning algorithm like neural nets to take into account these 
correlations. 
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If sufficiently tight cuts are placed on 
many different variables in order to 
achieve a high background rejection 
then the overall efficiency will be low. 


The values used in the selection (cuts) 
are also optimised for different n and 
Ey bins. 


Figure 12.20: Distribution of electron 
signal and background for the variable 
Rg (left) and the scaled discriminating 
variable dj, (right) [91]. 


The electron selection can be adapted 
to identify photons by reversing the re- 
quirement that there is a charged track 
matching the cluster in the electromag- 
netic calorimeter. Alternatively, photons 
can be identified with clusters in the 
electromagnetic calorimeter that match 
an identified photon conversion in the 
tracking detector. 
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we can apply ‘isolation’ selections. The basic idea is to measure the trans- 
verse energy within a cone around the direction of the electron but ex- 
cluding the electron energy itself. The cone is defined in (1, @) space by 
AR = 
the edge of the cone and electron 7 (@). The isolation can be computed 
using the calorimeter transverse energy or the sum of the transverse mo- 
mentum of the charged tracks inside the cone. 


EFFICIENCY MEASUREMENT 


The efficiency for the electron identification selections can be estimated 
from Monte Carlo simulations. The efficiency varies with electron 7 and 
transverse energy Er. In order to minimise systematic uncertainties asso- 
ciated with inaccuracies in the simulation, data-driven methods are used 
to correct the simulation. This is done using a ‘tag and probe’ method, 


similar to that used for measuring tracking efficiencies for muons (see sec- 


tion 10.1). 
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12.8 MUON IDENTIFICATION 


Muons lose their energy by ionisation rather than through bremsstrahlung 
or hadronic interactions. Therefore high energy muons will only lose a 
small amount of energy in the calorimeters and we can identify muons 

by matching tracks in dedicated position detectors behind the calorime- 
ter (‘muon chambers’) to a track measured in the inner tracking detectors 
(before the calorimeter). In addition, the minimum-ionising signature of 
muons in the calorimeters can also be used as additional information on 
the path of a muon if calorimeter signals compatible with a minimum- 
ionising energy deposition can be linked to the track candidate from the 
tracker. 

Different matching strategies in terms of how the hypothesis is seeded 
and which information from other systems is included, and different 
matching criteria, are often employed concurrently. The different algo- 
rithms help to maximise coverage and achieve sensitivity in areas where 
one of the subsystems performs poorer. While these general principles are 
valid for all detector systems, the details depend on the magnetic field sys- 
tem and the tracking detectors used. 


(An)? + (A¢ġ)?, where An (A@) is the difference in n (@) between 
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Typically, the cone size is AR = 0.2. 


Figure 12.21: Data-driven mea- 
surements of electron identification 
efficiency as a function of electron 
transverse energy Er for three values 
of the discriminating variable, called 
‘loose’, ‘medium’ and ‘tight’ for the 
ATLAS experiment [91]. The bottom 
panel shows the ratio between the data- 
driven and the Monte Carlo simulation 
estimates. 


Only very high energy muons (2 1 TeV) 
have significant energy loss by 
bremsstrahlung. The cross-section 

for muon-nucleus hadronic interactions 
is negligible for practical purposes. 


The different algorithms provide a 
trade-off between efficiency for identify- 
ing genuine muons versus the probabil- 
ity of misidentifying a hadron as a muon 
and can be selected for the needs of the 
specific physics analysis using the muon 
candidates. 
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If the calorimeter is sufficiently deep in units of hadronic interac- 
tion lengths, the probability of a charged hadron not interacting in the 
calorimeter will be negligible. However, there can be particles emerg- 
ing from the hadronic calorimeter from the tail of the hadronic shower 
and these hadrons can potentially create fake muon signatures (this back- 
ground is called ‘punch-through’). 

Another source of background muons is the decay in flight of charged 
pions and kaons. The semi-leptonic decays, 7 —> LV, or K — UV, can oc- 
cur before the calorimeter and therefore result in genuine muons entering 
the calorimeter. This background can be suppressed by identifying kinks 
in the tracks in the inner tracking system. The cases in which the kink can- 
not be identified because the pion or kaon are parallel with the muon, will 
result in a residual background. As with electrons, we can reject muons 
from heavy flavour decays using isolation requirements. 

The efficiency for the isolation selection can be measured in collider 
experiments using the same tag-and-probe method as discussed for muons 
(see section 10.1). The probability for identifying a genuine muon inside 
the acceptance in ATLAS and CMS is in the range 95% to 99%, and the 
probability of misidentifying a charged pion as a muon is in the range 
0.1% to 0.2%, depending on how tight the muon selection is. 


> 
S LHO -az a 2 —<—S 
xs igea J 
i J 
oe eData  oMC 
Læ J 
0.5 -* Tight muons _| 
i ATLAS =- Medium muons | 
| Ys = 13 TeV, 139 fb" Loose muons 
ml< 2.5 
oO 
= ki e 
g a ne oa See See r oer oe 
Sali 
4 6 8 10 12 14 16 18 20 
P, [GeV] 


As for electrons, we can suppress the contribution from muons from 
heavy flavour decays by isolation requirements. 


MUON MOMENTUM RESOLUTION 


After the identification of a muon, it is the momentum of the muon that is 
usually of interest. As for any charged particle this is determined from the 
curvature of the tracks in a magnetic field. To understand the design con- 
siderations of muon detection systems, it is instructive to look at the de- 
sign choices and performance of the muon systems in CMS and ATLAS. 

In the CMS experiment, muons are detected in the central silicon 
tracker and in a dedicated muon system, which is located in four stations 
within the iron return yoke, which closes the field lines for the central 
tracking solenoid. The precision position information in the barrel sec- 
tion is provided by square gaseous drift tube (DT) detectors, while cathode 
strip chambers (CSCs) are used in the endcaps. Both systems are supple- 
mented by resistive plate chambers (RPCs), which provide trigger and 
precise timing information, but with lower spatial precision. 
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INTEGRAL PUNCHTHROUGH PROBABILITY 


5 10 


COUNTER NUMBER 

(~Ilem steel equiv, per counter) 
Figure 12.22: The integral prob- 
ability distributions for hadronic 
shower punch-through as a function 
of calorimeter depth for hadrons of 
different energies, measured with the 
CCFR experiment [369]. One counter 
corresponds to 11 cm of steel, about 
2/3 Ain- 
The corresponding probability for kaons 
is slightly larger, in the range 0.3% to 
0.5%. 


Figure 12.23: Measured muon recon- 
struction and identification efficiencies 
for different criteria in ATLAS [92]. 
The efficiencies have been measured 
in J/¥ — uu using the tag-and-probe 
technique. 


For muons from Z or W decays there 
will in general be no other particles 
with large pr at angles close to the 
muons. However, muons from heavy 
flavour decay such as b + cuv will tend 
to have other high pr particles at angles 
close to the muon from the decays of the 
charm quark. See [202] for an example 
of the measurement of the efficiency for 
isolation selection. 


The magnetic field strength inside the 
central solenoid is 3.8 T. 
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ATLAS uses a smaller superconducting solenoid with a magnetic field 
of 2 T for the inner tracking volume and a system of discrete coils to cre- 
ate a large air core toroid. There are 8 coils each in the barrel and the two 
endcaps, with a rotation of 22.5° between barrel and endcaps. The elec- 
tromagnetic and hadronic calorimeters are between the tracker solenoid 
and the muon system toroids. For the precision position measurement in 
the central part, pressurised drift tubes (monitored drift tubes, MDTs) are 
used, together with RPCs for triggering, while in the forward direction, 
CSCs for precision and thin gap chambers (TGCs) for trigger measure- 
ments are used. Due to the higher pressure and the drift geometry, the sin- 
gle wire resolution of the MDTs is better than for the DTs in CMS (about 
80 um compared to 200-300 um). 

The large air-core toroid for ATLAS was chosen at the time of the 
planning of the LHC experiments to achieve a good standalone perfor- 
mance of the muon system, to add robustness in case the inner tracking 
system would not perform as well as planned in the unexplored regime of 
high energy and high luminosity at the LHC. 
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The momentum resolution obtained for the ATLAS muon spectrom- 
eter demonstrates that the goal of good standalone momentum resolu- 
tion was achieved. In the barrel for very low momenta the energy resolu- 
tion is affected by energy loss fluctuations in the calorimeter, but above 
~10 GeV/c the resolution improves due to the low material in the air- 
core field, until the resolution degrades again at high momenta, due to the 
sagitta measurement error. 
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Figure 12.24: Layout of a DT cell with 
electric field lines of the drift field 
superimposed [203]. 


This concern turned out to be unnec- 
essary, the inner tracking detectors for 
both ATLAS and CMS turned out to 
have excellent performance. 


Figure 12.25: Relative momentum 
resolution as a function of the muon 
transverse momentum showing the 
stand-alone resolution of the muon 
systems and the inner trackers, and their 
combined resolution [417] (© Deutsche 
Physikalische Gesellschaft. Reproduced 
by permission of IOP Publishing. CC 
BY-NC-SA). (a) and (c) ATLAS for 

|n| < 1.5 and |n| > 1.5, respectively. 
(b) and (d) CMS for |n| < 0.2 and 

1.8 < |n| < 2.0, respectively. 
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In comparison, the standalone momentum resolution of CMS is worse, 


due to multiple scattering in the iron yoke. However, combined with the The effects cl sa Ea A on 
Z ‘ muon momentum resolution is ad- 
inner tracker the performance is better than for ATLAS, except for large dressed in exercise 3. 


momenta in the endcaps, because of the much larger value of BL? (see 
section 10.4). As the performance of the muon system in CMS is limited 
by multiple scattering, the demands for single wire resolution and cham- 
ber alignment of the precision detectors are much relaxed. 

The material also affects the trigger. In the case of an air-core magnet, 
the momentum resolution of the trigger chambers is usually sufficient to 
provide an efficient threshold up to relatively high pr. For an iron core, 
information from the inner tracking detectors is needed to provide a suffi- 
ciently sharp threshold. A disadvantage of the muon system being embed- 
ded in the return flux to the central solenoid field is that the field has the 
opposite direction to that in the tracker, so the tracks curve in the opposite 
direction. This will bend back the trajectories of low momentum muons 
from hadron decays, so that they can point to the vertex and occasionally 
fake a high momentum muon. 


12.9 TAU IDENTIFICATION 


The identification of tau leptons is much more challenging than that of 
electrons or muons as in general purpose collider experiments the tau will 
decay in the tracking volume. The 3-body leptonic decays T + /v;V, will Tre decay also involves a neutrino. 
in general result in electrons or muons with lower energy. This makes it 
harder to identify it in the presence of a very large hadronic background at 
low energy. In addition, there will be a background from genuine isolated 
electrons from W —> eve and W —> [LV,, decays. 
Therefore, tau identification focuses on the hadronic decay modes. The 
largest background is from jets of hadrons from QCD processes. Several 
criteria can be used to distinguish this from tau decays. 


e The lateral dimension of the jet of hadrons from a tau decay at high 
energy will be limited and in general less wide than that of the back- 
ground. 


e The hadronic tau decays will result in fewer charged hadrons than is 
typical in QCD jets. 


e Genuine tau decays will have a net charge of +1, therefore only 
candidates with a net charge of +1 of the reconstructed final state 
are used [204]. 


The CMS tau identification uses the hadronic decay modes listed in 
Table 12.1. 


Decay mode Resonance BF (%) Table 12.1: Decay modes and branching 


= = ratios for tau decays used in CMS tau 
A = h “r Wo identification. h* refers to hadrons (n* 
T Oh r v p(770) 25.9 or K*). Where relevant, the table gives 
T —+h- 77 Vr aı(1260) 9.5 intermediate resonances [204]. 
tT >hh-ht nv, — a, (1260) 9.8 


The CMS tau reconstruction combines the information from the re- 
constructed charged tracks associated with a jet with electron and photon 
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candidates within a window of An x Ad. The total vector momentum and 
energy in this region (called strips) is calculated. In the first version of 
the procedure a fixed window size of 0.05 x 0.2 is used. Charged hadrons 
from tau decays interacting in the detector material can lead to hadrons 
detected outside this window. Similarly, the effects of pair production, 
multiple scattering and the large magnetic field can result in electrons or 
photons detected outside this window. These energy deposits will decrease 
the efficiency of isolation requirements. Conversely, at higher values of 
pr Of the tau, the energy will tend to be contained within a smaller cone. 
Therefore, in the more sophisticated version of the analysis, the fixed win- 
dow algorithm is replaced with a dynamic window which depends on the 
pr of the tau [204]. 

The next step in the analysis is to reconstruct the invariant mass of 
the charged hadrons using the momentum measured in the strip tracker, 
m} (for the decay modes other than T~ — h~ Vz). The value of m? should 
be compatible within errors with the masses of the resonances listed in 
Table 12.1. 

The identification of genuine taus while rejecting backgrounds from 
QCD jets is optimised using a Multi-Variate Analysis (MVA). There are 
two types of variables used [205]: 


1. Lifetime. The finite lifetime of the tau (2.91 x 107! s) is used to de- 
fine several variables that provide some discrimination against QCD 
backgrounds. The simplest one uses the transverse impact param- 
eter dọ (see section 10.6) of the ‘leading’ (i.e. highest value of pr) 
hadron in the tau candidate final state and its estimated uncertainty 
Oxy. The ‘significance’ is defined by do / Od: 


2. Isolation. Genuine tau decays should have a small total momentum 
in the tracker and energy in the calorimeter in the region around the 
reconstructed tau. An isolation cone size of R = 0.5 is used. In a 
high-luminosity collider like LHC, there are many primary interac- 
tions within the same bunch crossing. These will occur over a range 
of longitudinal coordinates z. The pattern recognition will therefore 
reconstruct several primary vertices, with each track associated to 
a particular vertex. In order to minimise pile-up effects, only tracks 
compatible with the ‘hard scattering’ vertex are used. For the neu- 
tral particles, a statistical subtraction of the energy due to pile-up is 
made. 


The performance of the tau identification in CMS is illustrated in Fig- 
ure 12.26. 

Additional MVAs are used to separate genuine taus from backgrounds 
from muons and electrons. The efficiencies shown in Figure 12.26 are 
based on Monte Carlo simulations. Data-driven corrections are determined 
using the tag-and-probe methodology. The procedure uses the signal from 
Z — TtT with one tau decaying to a muon and the other tau decaying 
hadronically. The muon provides the ‘tag’ and the identification efficiency 
is measured for the hadronic tau decay mode. A ‘visible’ mass, myjs is de- 
fined as the invariant mass of the muon and the tau. Distributions of myjs 
are fitted to the expected peak from Z —> t*t~ and backgrounds. 
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Multi-Variate Analysis uses machine 
learning to allow for the correlations 
between variables to optimise the classi- 
fication of signal/background. 


The cone is defined in N, $ space as 
R= y (An} + (Ag). 


i.e. the vertex with high transverse 
momentum tracks associated to it. 
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12.10 MISSING Ey 


In a collider detector, the probability of a neutrino interacting in the detec- 
tor is practically zero. If a neutrino is created in the collision, this results 
in an imbalance of the measured momentum in the event, which can be 
used to infer the presence of the neutrino (or an exotic, weakly interacting, 
electrically neutral particle). 

In a collider experiment, a significant number of particles will travel 
inside or close to the beam pipe, where they avoid detection. Therefore it 
is not practical to measure the component of the total momentum in the 
beam direction. Also, typically the detector system which provides the 
most complete coverage of the solid angle, and can also supply measure- 
ments of neutral particles, are the calorimeters, which measure the energy 
of incoming particles. With sufficiently fine segmentation of the calorime- 
ter, we can assign a direction of the energy flow from the collision point. 
Hence, we use “missing transverse energy’ to determine the imbalance in 
the detected particles, which can be used to identify the presence of neu- 
trino(s) and get information on the kinematic properties. 

In a simplified picture, we assume that the energy E; is measured in 
calorimeter cell i and the line from the centre of the detector to the centre 
of the cell is at a polar angle 0; and azimuthal angle @;. We then define the 
missing transverse energy 2-D vector as 


ER = =} sin 0;cos ĝ;, and Eï = ~ Ei sin O; sin gj. (12.7) 


The magnitude of the missing transverse energy is then defined by 


miss __ 
Ey = 


In a real detector, there will always be a non-zero value of ft from 
fluctuations in the energy measurements. The resolution in the measure- 
ment of EF"’* is found to scale approximately as 


o(EE) o< [Ei sin Oi. 


In addition to this source of background from measurement errors, there 
will be a background to ‘prompt’ neutrinos from heavy flavour decays 
(e.g. semi-leptonic decays of b-hadrons). 


(12.8) 
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Figure 12.26: CMS tau identification 
efficiency for taus from a Higgs boson 
decay H + t*T~ versus efficiency 
for falsely identifying a QCD jet as a 
tau [204]. 


Missing transverse energy is a marker 
for many beyond the standard model 
physics searches. 


This method cannot identify if there is 
more than one neutrino in the event. 


If there are identified muon(s) in the 
event, we need to add their transverse 
momentum to that determined by the 
calorimeter. 


Sometimes the symbol Ér is used for the 
missing transverse energy. 
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In the very sophisticated LHC detectors, the resolution can be im- 
proved by considering the momentum flow of different objects. The miss- 
ing transverse energy in the x-direction (with an equivalent definition in y) 


is [93], 
E=- } pic—) pic, 
ichard i€soft 
where the ‘hard’ terms refer to the reconstructed electrons, photons, 
muons, taus and hadronic jets. The ‘soft’ terms refer to the tracks from 
the primary vertex that were not assigned to any of the hard objects. 


12.11 NEUTRINO FLAVOUR IDENTIFICATION 


In neutrino experiments it is often important to be able to identify the 
flavour of a neutrino responsible for an interaction. It is not possible to 
identify the neutrino flavour in a neutral current process but in a charged 
current process lepton flavour conservation ensures that the flavour of 
the neutrino is transferred to the charged lepton. Therefore the problem 
of neutrino flavour identification is transferred to that of identifying the 
flavour of the outgoing charged lepton (e, u or T). 

The techniques used for identification of the charged leptons will vary 
depending on the detector technology used for the neutrino detector. For 
example, the Super-Kamiokande (SK) experiment uses a very large vol- 
ume of water (approximately 50,000 tonnes) viewed by 13,000 photo- 
multiplier tubes (PMTs). The u and e from the neutrino interaction will in 
general be above threshold for the emission of Cherenkov radiation, which 
is then detected by the PMTs. Therefore both relativistic electrons and 
muons will result in a ring of ‘hits’ in the PMTs at an angle to the direc- 
tion of the electron or muon. However, electrons will start electromagnetic 
showers in the water and this will then result in emission at a range of an- 
gles. Therefore the electron rings will tend to be fuzzier than the sharp 
muon rings. This information is used quantitatively by constructing a like- 
lihood [294] 


unhit 


L(V, 0) = | | Pj(unhit|;) x 
j 
hit 


x I {[1—P,(unhit|,;)] fo (gil mi) F(T, @)}. (12.9) 


T is the event hypothesis (e.g. electron- or muon-like) and 0 represents the 
kinematic properties. u; = L;(T, 0) is the expected number of photoelec- 
trons produced by the ith PMT given the hypothesis. P is the probability 
that it does or does not register a hit given the fitting hypothesis. The like- 
lihood considers all the PMTs (the ones that are hit and and the ones that 
are not hit). For the hit PMTs, the measured charge and time are used. 
fa(qi|Mi) is the probability of observing a charge q; for a predicted charge 
li and similarly f (t;|I, 0) is the probability for a hit to occur at time t; 
given the event hypothesis I’ and the kinematic properties. 

In iron/scintillator calorimeters such as MINOS the neutrino flavour 
identification uses the shape of the energy deposition in the calorime- 
ters. The electrons from ve will result in short electromagnetic showers, 
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For example, for electrons the dedi- 
cated calibration and reconstruction 
procedures will give more precise mea- 
surements of the electron pr than simply 
using the uncorrected energies in the 
calorimeter (see section 11.7). Similar 
procedures improve the resolution for 
the other hard objects. 


Care has to be taken to avoid using 
the same calorimeter cells in multiple 
objects. There is a signal ambiguity 
resolution algorithm that assigns cells 
to not more than one object [93]. 


This is obviously essential for any ex- 
periment studying neutrino oscillations. 


See Figure 2.38. 


An example of a neutrino charged cur- 
rent interaction would be v.X > e-Y, 
where X and Y represent some hadronic 
states. 


For a more detailed discussion of SK 


see section 14.2. 


See section 12.4. 


See Figure 12.9. 


For a more detailed discussion of 
MINOS see section 14.2. 
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whereas V, will result in muons that will propagate long distances in the 
calorimeter without showering. The momentum of the muon can be deter- 
mined from its curvature in a magnetic field. 
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Figure 12.27: Electron/muon separation 
in the SK experiment [294]. The figures 
show the likelihood distributions for 
electrons and muons. The left (right) 
plot is for sub-GeV (multi-GeV) atmo- 
spheric neutrinos. The points with error 
bars are data and the histograms are 
from Monte Carlo simulations of Ve and 
Vy interactions. 


If the energy is sufficiently large the 
muons will exit the calorimeter but 
lower energy muons will ‘range out’, 
i.e. stop in the calorimeter. For these 
muons, the energy can be estimated 
from the measured track range. 
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Figure 12.28: Neutrino interaction topologies observed in the MINOS detectors [236]. Left: A CC Vy interaction (muon track). Middle: A NC in- 
teraction (nuclear recoil). Right: A CC v, interaction (electron shower). Each coloured rectangle represents an excited scintillator strip, the colour 
indicating the amount of light: purple and blue are low light levels, through to orange and red for the highest light levels. 


The identification of v; is much more difficult due to the added chal- 
lenge of identifying and reconstructing the tau from the charged current 
interaction. This was achieved in the DONUT experiment [314]. The ob- 
servation was based on the use of photographic emulsion (see section 6.1), 
in which the decays of the T could be detected as either kinks in the track 
(‘1-prong’) or vertices with one charged track turning into an odd number 
of charged tracks. Owing to the very small neutrino cross-sections, a very 
large volume of emulsion was required. It would not have been feasible to 
scan the full volume of emulsion for candidate vertices. Therefore external 
detectors were used to record charged tracks emerging from the emulsion 
and thus identify locations of interest where the emulsion was scanned and 
the tracks from the vertex measured. The emulsion targets consisted of a 
stack made of layers of 1 mm thick stainless steel sheets interleaved with 
emulsion plates made of 100 um thick emulsion layers on each side of 
an 800 um thick plastic base. The length of exposure for each target was 
set by the track density from muons, with a limit of 10° cm~? that was 
reached within about one month. 


F.L. = 540 um 

Oink = 13 mrad 

p > 210$ GeV/c 
Pr > 0.28 *533 GeV/c 


f 7 bY 


Figure 12.29: v; CC interaction event 
in the DONUT experiment [314]. The 
neutrinos are incident from the left. 
The scale is given by the perpendicular 
lines with the vertical line representing 
0.1 mm and the horizontal 1.0 mm. The 
target material is shown by the bar at 
the bottom of each part of the figure 
representing steel (grey), emulsion 
(hatched) and plastic (no shading). 
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12.12 JET TAGGING 


The most important type of jet-tagging is identifying heavy flavours, 
i.e. hadrons that contain b or c quarks. We will focus here on b-tagging 
because the longer lifetime of b-hadrons makes them easier to identify. 
There are two possible signatures we can use to distinguish b-jets from 
light quark or gluon jets: 


e The semi-leptonic decay mode of the b-hadrons can result in muons 
or electrons with high transverse momentum with respect to the 
direction of the b-hadron. 


e The relatively long lifetime of the b-hadrons (@(1 ps)) allows to 
reconstruct their decay vertices, or to detect tracks that do not point 
back to the primary vertex. 


The first technique is limited in efficiency by the magnitude of the 
branching ratios of the semi-leptonic decay modes of b-hadrons. In prac- 
tice it is much easier to identify a muon inside a jet rather than an electron, 
because it can be detected in isolation in the muon system. On the other 
hand, it is difficult to distinguish in the calorimeter the electron shower 
from the other particles in the jet. It is therefore usually only the decays 
with muons that are used. The key signature is that of a muon with a di- 
rection close to the jet axis but with a significant component of momentum 
perpendicular to the jet axis. 

The second technique is based on the significant flight path of a 
b-hadron before it decays. Therefore, high energy b-hadrons can have de- 
cay distances of several mm. In most cases for collider detectors, the de- 
cays will occur inside the beam pipe and we have to reconstruct the decay 
from the charged tracks measured by the vertex detector surrounding the 
beam pipe. If a secondary vertex can be reconstructed that is significantly 
displaced from the primary vertex, this provides very powerful separation 
between B hadrons and light-quark hadrons. However, even if a secondary 
vertex cannot be reconstructed, powerful b-tagging can still be performed, 
as there will be tracks resulting from the b-hadron decay that do not point 
back to the primary vertex. A signed impact parameter is defined in the 
r-@ view such that genuine decays will have a positive value but the result 
of random measurement errors will result in a positive or negative value. 

A similar definition can be made in the longitudinal plane. A per-track 
significance is defined as Sgo = do/ Odo» Where do is the impact parameter 
and Oy, is its uncertainty. At large positive values of Syo there is a clear 
signal for long-lived hadrons from b-quarks. 

One relatively simple approach to b-tagging is to define a log- 
likelihood [94] 


(12.10) 


where the sum runs over all the tracks associated to a particular jet and pe 
(pi) is the probability for a track from a b-jet (light-quark jet) to result in 
the given value of Sj9. The estimated values of pt and pi‘ depend on the 
measured value of do but also the particle kinematics as this determines 
the uncertainty Ogy. 
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BR(B* —> Iv;X) = (10.99 £0.28)% 
and BR(B® — Iv,X) = (10.33 +0.28)%, 
where these branching ratios are aver- 
aged over electron and muon decay 
modes [505]. 


For B? mesons, ct = 456 um [505] (t is 
the lifetime of the meson). 
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Figure 12.30: Definition of signed trans- 
verse impact parameter dọ (after [370]). 
The sign of the impact parameter is 
based on the angle between the jet and 
the line between the primary vertex 

and the point of closest approach of 

the track. Top: positive sign, bottom: 
negative sign. 
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Figure 12.31: Simulated distributions 
4 of Sao for tracks from b-jets, c-jets and 
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An alternative approach combines the measurements of the impact pa- 
rameter in the transverse and longitudinal planes. A third method uses 
reconstructed secondary vertices. The simple log-likelihood approach 


ignores the strong correlations between tracks from the same jet. These In principle, the likelihood could be 

K é P , < é extended to a very large number of 
correlations can be ‘learnt’ by machine learning algorithms, so they can variables to accommodate this but this 
produce superior performance compared to the simpler log-likelihood ap- "2! computationally practical. 
proaches. 


The ‘low-level’ taggers are broadly based on two different ap- 
proaches [94]. 


e Impact parameter significance. These use the large impact param- 
eters (either in 2D or 3D) of charged particles from the decays of 
b-hadrons. In order to maximise the separation power between 
signal and background these algorithms use the estimated ‘signifi- 
cance’ for an impact parameter dp as do/ Ody: 


e Use of secondary vertices. In one version this approach, the JETFIT- 
TER algorithm [95] tries to reconstruct the full b-hadron to c-hadron 
decay chain. 


The performance can be improved by combining the results of the low- 
level taggers in so called high-level taggers. As the information between 
the low-level taggers is correlated, this is done using machine learning 
methods like Boosted Decision Trees or Neural Nets. The performance 
of b-tagging techniques are illustrated in the plot of b-tagging efficiency 
versus rejection of light quark jets (the inverse of the efficiency). The best 
b-jet detection efficiency versus rejection of light quark jets is achieved 
with the high-level taggers. 
A rejection factor against light-quark jets of ~1000 can be achieved at in addition to fake b-tags, physics 


z ; b analysis will also need to account for 
ab tagging efficiency of 70%. : : genuine B hadrons from gluon splitting 
The efficiencies calculated from Monte Carlo simulations need to be g— bb. 


corrected for inaccuracies in the simulation. This is done by measuring 
the b-tagging efficiency in a sample of tt (with t + bW and W — ev, or 
W — [LV,,) events and then defining scale factors as the ratio of efficiency 
determined from data to that from the Monte Carlo simulation. 
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We will now consider the LHCb Vertex Locator (VELO). The func- 
tion of the VELO is slightly different to that of the central detectors 
(e.g. ATLAS or CMS). The VELO is used to not just to tag the presence 
of b-hadrons but to fully reconstruct the vertices. This allows for the full 
reconstruction of b and c hadrons from secondary and tertiary vertices. In 
addition, the study of CP violation and oscillations requires excellent time 


2x10 


resolution for the secondary and tertiary vertices. 


The LHCb experiment is a single-arm spectrometer at the LHC. It is 
optimised for reconstruction of b-hadrons in the forward direction. The 
VELO uses similar technology, silicon microstrips, to that of ATLAS and 


CMS but the layout is very different [3]. 
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Figure 12.32: Light-quark flavour 
rejection ratio versus b-jet tagging 
efficiency in ATLAS for three low-level 
taggers (IP3D, SV1 and JetFitter) 

and two high-level taggers (MV2 and 
DLI) [94]. 


Figure 12.33: The scale factor between 
the b-tagging efficiency in ATLAS 
measured for data compared to Monte 
Carlo simulation for the MV2 algorithm 
as a function of jet transverse momen- 
tum [94]. 


In order to survive the very high flu- 
ences expected for the VELO silicon, 
the detectors uses n* -in-n and n* -in-p 
sensors, rather than pt -in-n. 
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Figure 12.34: Layout of the silicon detectors in the LHCb VELO [3]. The VELO divides in x into two halves for retraction during LHC beam injec- 


tion. 


In operation, the silicon is moved within 7 mm of the beam line. How- 
ever, this location would receive too high a fluence during beam injection 
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before acceleration, when beam losses are much larger. Therefore it is re- 
tracted from the beam line before there are stable beams. 

In order to optimise the space point resolution, analogue readout is 
used. Using analogue readout the baseline variations (pedestals) are sub- 
tracted from each channel in data taking. Analogue readout allows for 
the suppression of ‘common mode’ noise, which can arise from coher- 
ent noise across a sensor. The mean baseline level of all the ADC values 
corresponding to a sensor (excluding genuine hits) is calculated and this 
level is then used to correct all the ADC values for the sensor. 

The main advantage of analogue readout is the improved space point 
resolution for a given strip width, compared to a binary readout. Neigh- 
bouring strip hits (defined as ADC values) above a given threshold after 
baseline subtraction are clustered together. The charge (given by the ADC 
value) centroid is used to determine the hit location on the sensor. 

The position resolution can be determined by measuring residuals of 
the hit location compared to a fitted track. The resolution increases with 
strip pitch. It is also very sensitive to the projected angle between the track 
and the sensor, as this determines the amount of charge sharing. 
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The measured cluster centroids are used to fit tracks. The tracks are 
then extrapolated back to the primary vertex to determine the impact pa- 
rameter. There are two contributions to the resolution of the impact param- 
eter do, multiple scattering and point measurement uncertainties. The mea- 
sured impact parameter resolution can be parameterised (see exercise 2) as 
a function of transverse momentum, pr as 

A 


Og = — +B. 
PT 


(12.11) 


The parameter A depends on the detector thickness before the first layer in 
units of radiation length, and B depends on the detector resolution. 

The VELO strip detector has been replaced for LHC Run 2 with a hy- 
brid pixel detector. This provides superior resolution, higher radiation tol- 
erance and faster readout. This allows the detector to be moved closer to 
the beam pipe and therefore increases the acceptance [514]. 

In addition to using impact parameter measurements to identify de- 
cays of b and c hadrons, a vital requirement for LHCb is a full reconstruc- 
tion of the decay chains and a precise measurement of the time of flight 
from the primary vertex to the decay vertex. The decay time in the rest 
frame of the particle is given by t = mL/p, where m is the mass of the 
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The analogue signals from the front 

end ASIC are transferred to a repeater 
board outside the VELO detector and 
then sent via 60 m of twisted pair cables 
to the counting room where they are 
digitised. 


This threshold is set at six times the 
measured noise value. 


Figure 12.35: Cluster resolution in the 
LHCb VELO [3]. Left: As a function 
of strip pitch for different projection 
angles. Right: Resolution divided by 
pitch as function of the track-projected 
angle for four different strip pitches. 
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decaying particle, L the distance from the primary to the decay vertex and 
p is the momentum in the LHCb frame. The resolution of the time mea- 
surement depends on the particular decay topology. Results obtained for 
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Figure 12.36: Measured transverse 
impact parameter resolution in VELO 
in the x direction as a function of track 
inverse transverse momentum, 1/pr [3]. 
A simple linear fit was used rather 
than the addition of the two terms in 
quadrature (as would be expected for 
uncorrelated errors) because the errors 
are dominated by multiple scattering 
over most of the range of 1/pr used in 
the fit. 


Figure 12.37: Measured decay time 
resolution in VELO for the topology 
B? > J/¥o —> utu“ K+K- [3]. The 
contribution from genuine decays is 
removed before making this plot to 
avoid a spurious tail at positive times. 
The resolution quoted is based on a fit 
to the negative side of the distribution. 
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Key lessons from this chapter 


Identification of particles provides an important kinematic constraint, as it identifies 
the mass of the particle unambiguously, and it identifies the flavour of the particle. 


Dedicated particle identification detectors provide a measurement of the velocity of 
the particle, in the form of either the relativistic B or y factors. Combined with a mo- 
mentum measurement this allows for reconstruction of the mass. 


Dedicated particle identification technologies comprise, in order of increasing By 
of the sensitive momentum region, time-of-flight, dE /dx, Cherenkov radiation and 
transition radiation. 


The separation power of TOF is limited by the distance between the timing detectors, 
and their time resolution. 


Separation by dE /dx is easy for low energies, but becomes hard for minimum- 
ionising momenta and above. Due to the density effect, above minimum-ionising 
only gaseous detectors provide enough differentiation in dE /dx. The main challenge 
is the straggling of the energy loss, and one way to reduce its effects is the use of the 
truncated mean of a number of individual energy loss measurements. Alternatively, 
cluster-counting is less affected by straggling. 


The most powerful Cherenkov detectors measure the opening angle of the Cherenkov 
cone for a measurement of p. The challenge is that continuous tracking of the pho- 
tons is not possible. In practise, the endpoint of the photon trajectory is measured, 
and the start point is inferred from other means (either proximity or mirror focusing). 
To maximise the range of sensitivity RICH system typically comprise more than one 
radiator, with different indices of refraction. 


The main challenge for the detection of transition radiation is the small probability 
for the emission of a photon on one interface, so that a large number of interfaces is 
required. To maximise photon collection in the detectors low-Z radiators and high-Z 
detectors are used. TR is sensitive to the relativistic y factor, and thus it is primarily 
useful in experiments for electron identification. 


Other particles (electrons, muons, jets, tau and heavy flavour hadrons) can be identi- 
fied from the characteristic signatures and combination of the tracking and calorime- 
ter systems in a standard particle physics experiment. For muons this relies on fil- 
tering of hadrons in the calorimeters, for electrons on the typical dimensions of 
electromagnetic showers. In all cases matching of tracks and showers in different 
subsystems and isolation cuts are powerful means to suppress backgrounds. 


The flavour of neutrinos in neutrino experiments can be inferred from the flavour of 
the lepton produced in charged current reactions. 


EXERCISES 


1. TOF separation power. 
Starting from m = p/(BY) = p\/1/B2 —1 and B = L/(tc), derive eq. (12.2). Derive eq. (12.3). 
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2. Errors on vertex reconstruction. 


Consider a cylindrical vertex detector with detector layers at a radius rı and r2. The resolution of 
the detectors are 0; and 02. 


a) Ignoring the effects of the magnetic field and multiple scattering, what is the uncertainty on 
the measured impact parameter do? 


b) For the same geometry, ignore the effects of the finite measurement errors and consider the 
effects of multiple scatting. The material in the beam pipe and the first detector layer is x in 
units of radiation length. What is the uncertainty on the measured impact parameter dp as a 
function of transverse momentum pr? 


3. Measurement in a muon spectrometer. 
Consider a magnetised iron muon spectrometer. The magnetic field is B and the track length in the 


spectrometer is /. The momentum of a unit charged particle transverse to the magnetic field is pr. 


a) What is the change in pr from the start to the end of the trajectory in the magnetic field (ig- 
noring the effects of multiple scattering)? You can assume that the deflection of the particle is 
small and that it has a speed of B = 1. 


b) What is the uncertainty in the angular deflection of the particle in traversing the iron, due to 
multiple scatting? What is the corresponding uncertainty in pr? 


c) What is the resulting fractional resolution in pr arising from multiple scattering alone? Eval- 
uate this for a muon spectrometer with / = 2 m and B = 2 T. Comment on the performance of 
such a spectrometer, compared to the resolution achieved by ATLAS and CMS. 

4. B® and vertex layers. 
Consider the decays of a B° hadron in a collider detector with a cylindrical vertex detector. The 
first detector layer is at a radius r = 3 cm. The lifetime of the B? is t = 1 ps. What fraction of the 
B? hadrons decay before the first layer for transverse momentum values of 5, 10 and 50 GeV/c? 
5. Momentum resolution in a solenoid. 


Consider a tracking detector inside a solenoid with an axial magnetic field B = 4 T and a radius 
R=2m. The resolution of the space points at the start, middle and end of the tracks is 100 um. 


Neglecting multiple scattering calculate the resolution for the measurement of pr as a function of 
pr for tracks that are perpendicular to the axial magnetic field. Compare this resolution with that 
of an iron core spectrometer (question 3). 
6. e/7 separation using a TRT. 

Consider a TRT with 30 straw chambers for the readout. The efficiency for getting a high threshold 
hit in one straw is 20% for electrons and 5% for pions. Electrons are selected if there are Nmin high 
threshold hits out of 30. For the purposes of this question assume that the hits are uncorrelated. 

a) What value should be used for Nin if we want an electron efficiency > 90%? 

b) What would the resulting pion efficiency be for this value of Nin? 


c) Compare your answer with that given for ATLAS in Figure 12.16 and discuss the origin of 
any discrepancies. 


3 Triggers 


INTRODUCTION 


In general, in particle physics interesting events and the response they 
produce in detectors are instantaneous, or they have a very short dura- 
tion given by decay lifetimes. This duration can be increased by the finite 
response time of the detectors, but is still very short compared to the op- 
erational time of the experiment. Other events, from interesting physics 

or backgrounds, will take place before and after, and thus, in general, the 
experiment will need to identify the occurrence of specific events in time 
and record the information from its detectors for this instance. This identi- 
fication is provided by the trigger. 

In experiments searching for rare physics, the rate of triggered events 
will be low, but signal events still need identification among backgrounds. 
In many other cases, the combined rate of signal and background events, 
and the amount of data recorded for each event, makes readout of all 
events occurring in the experiment prohibitive, and the task of the trigger 
in these experiments is to reduce the rate of events selectively to a man- 
ageable level, while significantly enhancing the fraction of potential signal 
events in the process. 

For any measurements of (relative) rates, for example for measure- 
ments of cross-sections, the (relative) trigger efficiencies must be known, 
and the error on this knowledge is often a significant contribution to the 
error of the actual measurement. Events that are rejected by a trigger are 
lost, so great care has to be used in designing triggers and understanding 
any additional bias they introduce. 

There are many different types of triggers that are used by different ex- 
periments. This chapter will not aim to be comprehensive, rather we start 
with discussing some basic principles, then look at the trigger challenges 
at more complex experiments, and finally study some examples for trig- 
gers in different particle physics experiments. 


13.1 


13.2 BASIC TRIGGER CONCEPTS 
COINCIDENCES 


Coincidences are a way to identify the hits belonging to a specific event 
topology. In coincidences the correlation of hits is done in the time do- 
main, whereas a spatial correlation can be achieved by the position of the 
included detectors. Coincidences are usually used for fairly large detection 
elements (for example scintillator paddles). 

Coincidences are particularly useful when the detectors included are 
producing signals with a significant fake rate, for example due to noise or 
thermally induced signals in detectors with large internal gain (for exam- 
ple photomultipliers). 

In coincidence circuits two or more binary signals that are expected for 
the observation of an event are required to occur in time with each other 
for a positive decision that the event has occurred. If we have two binary 


DOI: 10.1201/9781003287674-13 


An exception are emulsion neutrino 
detectors (see section 6.1), which can 
operate with very long exposure times 
due to the low local event density. How- 
ever, even these detectors are paired 
with electronic (and triggered) detec- 
tors, that indicate where the interesting 
physics event took place. 


Coincidences can also be used to obtain 
clean start/stop signals for time mea- 
surements (for example of drifttimes or 
TOF). 
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input signals, with duration 7; and T, and signal rates (including fakes) R1 
and R2, respectively, then the rate of fake coincidences is 
RÈ = RiR (Ti +T). (13.1) 

Usually 7; + T> < (R12)! and thus the fake rate of the coincidence will 
be much smaller than the fake rates of the inputs. 

Both input signals will have their inherent inefficiency from the sensor 
itself and the comparator decision in the conversion to a binary signal. 
If the efficiencies for the two signals are €; and &2, respectively, then the 
combined efficiency is 


Ec = E182. (13.2) 


If the input fake rates are high, then the duration of the signals must be 
small, and the coincidence will require careful adjustment of the time of 
arrival of the two input signals. Including further inputs will improve the 
fake rejection. 

Coincidences can also include inverted inputs, if the absence of a sig- 
nal in that detector is part of the desired event topology. For example, if 
we wish to record the response of a detector to K* in a beam consisting 
of K* and z* we could add an upstream threshold Cherenkov detector. If 
the radiator is chosen appropriately for the momentum of the beam parti- 
cles, the K+ would be below threshold for Cherenkov radiation but the 2* 
would be above threshold. We could then use the logical signal from the 
Cherenkov counter as a veto. 


TRIGGER SIGNATURES AND TURN-ON 


More generally, a binary trigger decision is based on an analogue mea- 
surement that is compared to a trigger threshold value. The analogue input 
can be simply the signal height in a single or multiple detector cells, or it 
can be kinematic information gathered from a detector system (like the 
momentum measured in a magnetic spectrometer, or the missing Er in 

a collider experiment). Both efficiency and fake rate will depend on the 
threshold, with the rejection of backgrounds improving, but signal effi- 
ciency decreasing, for increasing threshold. 

To minimise the processing power required for the trigger, and to 
achieve a fast decision time, the reconstruction of kinematic event prop- 
erties is usually done at a relatively coarse level, with a limited resolution. 
As a result, the trigger will not switch its response exactly at the thresh- 
old value, but the transition from rejection to full acceptance will occur 
gradually as the real value of the kinematic trigger variable increases. This 
behaviour can be illustrated using trigger ‘turn-on curves’, which show the 
efficiency as a function of the true value of the kinematic trigger variable 
(which, for example, can be obtained in the more accurate offline analy- 
sis). A sharp turn-on facilitates the understanding of the trigger efficiency. 
In addition, the background processes usually have a steeply falling spec- 
trum as a function of the variable used for the trigger decision (e.g. pr). 
Improving the resolution allows to reduce the threshold and thus can sig- 
nificantly reduce the fraction of background events accepted. 
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The logical operation can be performed 
by an AND gate. For simple appli- 
cations, for example in test beams, 
standardised modular electronics are 

in wide use, for example based on the 
Nuclear Instrumentation Module (NIM) 
standard [213, 329]. For more complex 
topologies field-programmable gate 
arrays (FPGAs) are available. 


This assumes that the two signals arrive 
at the same time, which can be achieved 
with appropriate delays. 


This is called an ‘anti-coincidence’. 


Coincidences have played an impor- 
tant role in early experimental particle 
physics using detectors with electronic 
readout, leading to the Nobel prize 

for Walther Bothe ‘for the coincidence 
method and his discoveries made there- 
with’. See for example [156]. 


This is referred to as the trigger ‘signa- 
ture’. 


Another reason for degraded resolution 
at the trigger level can be the lack of 
availability of accurate calibration or 
alignment data during the data taking. 


ideal trigger 
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Figure 13.1: Schematic trigger turn-on 
curve. 
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13.3 COMPLEX TRIGGERS 
TRIGGER TYPES 


It is obvious that in general a trigger should have a high efficiency for 
recognising signal events, and at the same time reject a large fraction of 
background events. However, often more complex trigger capabilities are 
required. Different physics studies can require datasets obtained with more 
than one type of trigger, and some of the experimental bandwidth must 
usually be reserved for secondary datasets needed for instrumental and 
physics background studies, detector, tagging and trigger efficiency mea- 
surements, calibrations (for example for energy scales), etc. In a complex 
detector like at LHC there will be several trigger streams targeting differ- 
ent physics analyses. The trigger algorithms and thresholds at the different 
trigger levels will need to be tuned so as to maximise the physics potential 
given the constraint of the available bandwidth. One simple way to record 
data with unbiased triggers is pre-scaling, where a random fraction of the 
events is accepted, but at a rate which is reduced, so that it does not signif- 
icantly obstruct the collection of signal data. 

Another important way that pre-scaling is used is for triggers on high 
pr jets. The threshold for an un-prescaled jet trigger will need to be very 
large in order to keep the accepted event rate at a manageable level. How- 
ever, several lower thresholds can be used if the prescaling factors are de- 
creased. Therefore data is recorded over a larger range of pr and is avail- 
able for analysis. For example, the cross-section can be measured over a 
much wider range of pr than that which would have been possible using 
only an un-prescaled trigger. 


TIMING 


The finite time that it takes to come to a trigger decision by hardware or 
software becomes relevant when the event rate in an experiment is high. 
We call the time used to form the trigger decision and distribute the accep- 
tance signal to the individual readout channels the ‘latency’. 

After an event it usually takes some finite time until the detector is 
fully ready to record the next. This time is called the ‘deadtime’. It can 
comprise detector-related time constants, for example to complete the drift 
of charges, or the delayed relaxation of excited states, and readout-related 
contributions from digitisation, and signal and trigger processing. In high- 
rate experiments the former is usually accepted as ‘pile-up’, as long as the 
data from individual events can be separated in the offline analysis. 

For a trigger rate R, and a deadtime Tų the fraction of recorded events 
is given by 

Ree L oe (13.3) 

Ri 1+R:Ta 

This is maximised (Rec ~ R;), if the deadtime and the trigger rate can be 
kept small (R,T < 1). In principle we can create a deadtimeless trigger 
at level 1 (L1) by using local pipeline memories that store the front-end 
data locally for the duration of the trigger latency. The data used for the 
hardware trigger is read out into pipelined processors. These processes 
will make a trigger decision after a fixed time. The trigger decision is pro- 
vided to the front-end electronics after a fixed latency. Accepted events are 
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Trigger efficiency is defined as the 
fraction of accepted events divided by 
the real events. In practice the latter 
corresponds to the events accepted in a 
physics offline analysis. 


Robustness can be improved by using 
redundant triggers, i.e. triggers target- 
ing the same type of physics events, but 
using different inputs. 


Since trigger rate vary with the instan- 
taneous luminosity, dynamic pre-scaling 
is sometimes used (pre-scaling frac- 
tions are decreased as the luminosity 
decreases during a run). 


These pre-scaled triggers can be very 
useful for detector studies, e.g. to mea- 
sure the trigger efficiency for a higher 
threshold trigger. 


In accelerator experiments there are 
also more extended periods of opera- 
tional deadtime, for example the time 
needed to fill the accelerator, where no 
collisions take place. 


This allows for data to be stored in the 
pipeline memories with no deadtime 
but there will still be deadtime in the 
readout of the data from the pipelines. 
For LHC experiments, the pipelines are 
implemented in the front-end ASICs that 
read out the detector data. The finite 
bandwidth of the readout of the data 
from the buffers will result in deadtime 
if the LI trigger rate is too high. In the 
case of ATLAS, preventative deadtime 
is introduced so that the buffers do not 
become full [96]. 
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transferred to buffers for subsequent readout. The data corresponding to 
non-triggered events will be overwritten with new data. 


MULTI-LEVEL TRIGGERS 


In complex high-rate, high-background experiments like at the LHC 
the data rates cannot be reduced to an acceptable level without the loss of 
too much efficiency for the signal events. The strategy is therefore to split 
the trigger decision into several levels (typically 2 to 4), with progressively 
lower trigger rates, allowing for progressively longer processing times. 
These higher level triggers are implemented in software, allowing more 
powerful algorithms to be used. Part of each trigger level is a local buffer 
that stores the data locally during the time of the trigger decision at this 
level. Trigger rates, latency and buffer size must be carefully matched, 
so that the readout bandwidth is fully used, while avoiding buffer over- 
flows. The maximum accept rate for each trigger level is set by the rate 
with which data can be transferred to, and processed at the next level. 

Because the first trigger level encompasses fast and simple decisions, 
and requires local buffering, first level triggers are usually implemented in 
hardware, whereas higher level triggers typically run on farms of standard 
commodity computers. 


TRIGGERS AT COLLIDER EXPERIMENTS 


In experiments at accelerators, the time structure of the bunches in the 
beam is usually used as a first indication when to activate the readout of 
the experiment. 

At collider experiments the primary targets for triggers are usually 
high-py muons, electrons or photons, high energy jets of hadrons or miss- 
ing transverse energy. Combinations of shower shapes in the calorimeters 
and tracking information are used to select electrons and photons. Particles 
penetrating the calorimeter and giving hits in the muon chambers are used 
for muon triggers. In addition, isolation criteria can be used for leptons 
to reject backgrounds from QCD jets. Multi-object triggers (for example 
combinations of leptons) can reduce trigger rates. 

Pipelined level 1 triggers operate with a fixed latency, which excludes, 
for example, the use of iterative algorithms. Level | trigger operations are 
thus usually either simple arithmetic operations or functions using mem- 
ory lookup tables. However, the complexity of algorithms that can be per- 
formed at level 1 has increased over time and for example the ATLAS L1 
trigger can perform mass cuts. To cope with the higher luminosity in LHC 
run 3 a new custom electronics processing system was built that enables 
the calorimeter data to be available with a tenfold increase of granularity, 
which allows the use of more sophisticated identification algorithms [97]. 

Higher trigger levels can reconstruct tracks and perform matching of 
information in different sub-detectors. They can also reconstruct event 
kinematics for effective mass and topology cuts. The complexity of higher 
level triggers has greatly increased, due to the wide availability of high- 
performance processors (graphics processing units, GPUs) and parallel 
operation. Level 2 trigger processing is generally still limited in execution 
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For the LHC experiments the latency is 
typically 3 us. 
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Figure 13.2: Typical multi-level readout 
and trigger structure. Due to increased 
processing power modern multi-level 

triggers are typically two-level triggers. 
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farms 


Field-programmable gate arrays 
(FPGAs) are the most powerful devices 
for use in level | trigger processing, 

as they allow for easy modification of 
trigger algorithms. 


Very different trigger signatures are 
needed for detectors targeting heavy 
quark (b or c) production or in heavy 
ion experiments. 


The numbering of the trigger levels 
is not consistent, with some experi- 
ments labelling the first trigger level as 
‘level 1’, while other call this ‘level 0’. 


For example matching of tracks in the 
tracker and the muon system, or of 
tracks and clusters in the calorimeter. 
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time. One way to achieve this is by dividing processing into ‘Regions of 


Interest’ (RoIs) determined by the Level 1 trigger data. The Rol approach will be discussed 
: i : A 2 2 later in this chapter in the context of 
The identification of the correct bunch-crossing by the L1 trigger is ATLAS and CMS triggers. 


essential in order to initiate the readout of the data from the pipelines cor- 
responding to the correct bunch crossing(s). Bunch-crossing identification 
can be challenging in large high-rate experiments as in the LHC, where 
the bunch-crossing frequency is 25 ns, much shorter than the signal ar- 
rival times and duration in many detectors, and even the time-of-flight of 
particles through the detector. 


13.4 TRIGGERS AT LHC 


One of the most challenging environments for triggering are the experi- 

ments at the LHC. At \/s = 14 TeV the total cross-section for pp interac- 

tions is o ~ 100 mb or about 100 million events per second at an instan- 

taneous luminosity of -Z = 10°? cm~?s~!. The difficulty is illustrated by 4 the LHC Run oe ae 
considering the case of Higgs boson production. The total Higgs boson ern ete 
cross-section o (H) = 54 pb and thus the event rate for events containing 

Higgs is 107° times smaller than the total collision rate. 


proton - (anti)proton cross sections Figure 13.3: Standard model cross- 
10 T T T 10° sections and event rates at hadron 
colliders [178]. 
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Furthermore, the branching ratios for the more easily identifiable de- 
cay modes of the Higgs are very small, e.g. BR(H — yy) = 2.27 x 107+. 
Therefore, the LHC has to operate at very high luminosity to achieve a 
useful rate for these type of rare events, which results in an extremely 
high rate of pp collisions. It is unfeasible to store the data from all these 
collisions and therefore very powerful trigger systems are required to re- 
duce the rate sufficiently to allow the data to be permanently stored. At the 
same time the triggers need to have a high efficiency for detecting these 
rare events. 

An additional complication for triggers at LHC is that there are col- 
lisions every 25 ns when bunches of protons collide. The length of time 
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required to form a trigger decision is much longer than this. To avoid very 
large deadtime, pipelined triggers are used. 

In general the triggers need to reduce the readout rate to a level that is 
acceptable based on the bandwidth available. Therefore, as the luminosity 
increases, there is a need to adjust the triggers to keep the rate manage- 
able. This can be done by a combination of increasing the trigger thresh- 
olds and by tightening the selection procedures used to select events. In 
addition, multi-object triggers can be used for some physics channels and 
these can use lower thresholds than single object triggers. 
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In LHCb the amount of data per event is smaller than in the general- 
purpose experiments (ATLAS and CMS), and this allows for a much 
larger LO trigger rate. Until long shutdown 2 (LS2, 2018) this was about 
1 MHz. The LO trigger was followed by a two-level HLT, running on an 
Event Filter Farm (EFF) of 52,000 logical CPU cores [450]. The upgrade 
of the LHCb detector uses a pure software trigger which will be discussed 
below. 

The ALICE experiment [42] at the LHC is designed for the study of 
heavy-ion collisions. The bunch-crossing rate for these type of collisions 
and the instantaneous luminosity are lower and thus the trigger rates are 


lower [321], but the very large track multiplicity results in very large event 


data sizes. 

In the following we will consider two examples of the multi-level 
approach for electron and muon triggers at the general purpose collider 
experiments. We will then look at the upgraded LHCb trigger system in 
which all the data is read out at 40 MHz and pure software triggers are 
used. 


ELECTRON/PHOTON TRIGGER IN ATLAS 


There are important physics channels that involve high transverse momen- 
tum electrons and photons (e.g. pp > tf, t + bev. and pp > H > yy). 

The ATLAS e/y trigger consists of two levels. At Level 1 (L1) a 
pipelined hardware trigger is used to reduce the rate from the bunch 
crossing rate of 40 MHz to ~ 20 kHz [98]. The L1 e/y trigger only uses 
calorimeter information. The trigger is pipelined, so that decisions are 
made every 25 ns with a latency of 2.5 us. In order to minimise the 
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The size of the general purpose LHC 
detectors is such that it takes a highly 
relativistic particle more than one bunch 
crossing (25 ns) from the vertex to the 
outer parts of the experiment. 


For example for H — yy a di-photon 
trigger can be used with lower pr 
thresholds than for single-photon trig- 
gers. 


Figure 13.4: Raw data transfers from 
Level | trigger in LHC experiments 
and a comparison with earlier experi- 
ments [391]. 


The bunch-crossing interval for heavy 
ion collisions at the LHC is 125 ns. 


This rate is ~ 20% of the total LI 
trigger bandwidth. 
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bandwidth required for the trigger, reduced granularity is used at L1 (see 
Figure 13.5). 
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The coarse granularity calorimeter trigger data is read out and pro- 
cessed in dedicated pipelined processors. A ‘sliding window’ algorithm 
is used, in which the transverse energy in a region of 2 x 2 neighbouring 
trigger towers is calculated. The region is stepped over the calorimeter in 
the 7 and @ directions to find local maxima of the energy in the window. 
As the signal involves isolated electrons and photons, powerful rejection 
of backgrounds is achieved using cuts on the energy in the ring of 12 cells 
surrounding the 2 x 2 cluster in the EM and hadronic calorimeters. Addi- 
tional rejection against backgrounds is obtained by cuts on the hadronic 
calorimeter energy behind the 2 x 2 EM cluster. 

The events are read out after an L1 trigger, so that the High Level Trig- 
ger (HLT) can be a software trigger using the full granularity of the detec- 
tor. The L1 e/y trigger seeds a Region of Interest (RoI) centred on the 2 x 2 
window. In order to reduce CPU time the HLT e/y trigger algorithms only 
look for electrons and photons within the RolIs defined by the L1 e/y trig- 
ger. To optimise the CPU requirement, simple algorithms are first applied 
in order to reject some of the backgrounds. For electron candidates with 
pr >15 GeV/c, simple cuts on variables like the ratio of hadronic to elec- 
tromagnetic energy in the cluster. This stage uses fast track reconstruction 
inside the Rol to perform track-cluster matching cuts [98]. 

After these fast algorithms have passed an event with electron candi- 
dates, more sophisticated algorithms are used. The cluster reconstruction 
and calibration used a similar likelihood method to that used in the offline 
analysis (see the discussion of eq. (12.6)). 

The overall electron efficiency can be factorised into the product of 
trigger efficiencies and offline efficiency, £a] = Eig €offine- The trigger 
efficiency is defined as Erig = N (trig) /N (offline), where N (trig) and 
N(offline) are the number of electrons identified at the trigger and offline 
level, respectively. The same type of ‘tag and probe’ analysis is used to 
measure the electron trigger efficiency as used to measure the offline effi- 
ciency (see section 12.7). 


Electromagnetic 
isolation ring 


Hadronic inner core 
and isolation ring 
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Figure 13.5: Schematic of the LI 
e/y trigger showing the coarser 
granularity used for the electro- 
magnetic (EM) calorimeter. The 
granularity is An x Ag = 0.1 x 0.1, 
whereas the granularity for the full 
readout of the calorimeter is much 
finer. For example in the second 
layer for N < 1.4 the granularity is 
An x A@ = 0.025 x 0.0.025 [99]. 


The backgrounds arise from QCD jets 
which have a very high rate, so that 
even if a low fraction of these events is 
misidentified as electrons the result is a 
significant accepted background rate. 


For electron candidates with 

Pr <15 GeV/c the background rates 

are higher and more sophisticated al- 
gorithms based on neural networks are 
used to reduce the rates. This procedure 
uses slightly more CPU time than the 
simple cuts algorithm but improved 

the background rejection by a factor of 
1.5-1.6 [98]. 
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The data show the characteristic turn-on behaviour with a low effi- 
ciency at very low pr and gradually rising to a plateau at high pr. The 
plateau is below 100% because of the need to reject a large fraction of the 
background. The efficiency varies across the detector as it is lower in the 
regions in which there is more material in front of the calorimeter. The 
effect of pile-up also reduces the trigger efficiency. Therefore the trigger 
efficiency is also measured as a function of 7 and the number of pile-up 
vertices, U [98]. 


MUON TRIGGER IN CMS 


CMS uses hardware triggers to reduce the bunch crossing rate of 40 MHz 
to 100 kHz. The rate is reduced to about 1 kHz by the HLT which used 
software triggers running on a CPU ‘farm’ with about 30,000 cores [206]. 
As there are several triggers for the different interesting final states, one 
trigger can only use a fraction of the total bandwidth, which is about 15% 
in the case of the muon triggers. 

The level 1 muon trigger uses ‘trigger primitives’ from the different 
stations, which are track segments in the different layers with an estimate 
of the angular coordinates, @ and @ from the different layers of each sub- 
system. Different hardware trigger systems are used for the three chamber 
types. For the case of the fast RPCs, the track segments are determined 
by a hardware pattern comparator. For the case of the slower DTs, local 
track segments are reconstructed by custom electronics on the chambers. 
This electronics also identifies the bunch crossing. The track segments 
are processed by FPGAs, which synchronise the data and send it via fast 
optical links to the counting room. 

In the counting room, the global muon trigger (GMT) matches track 
segments in the different layers and determines track quality and estimates 
the transverse momentum (pr). It uses a combination of look-up tables 
and FPGAs. The value of pr is estimated from the deflection in ¢ due to 
the solenoidal magnetic field. The GMT combines muon candidate tracks 
from three different algorithms. It resolves ambiguities with overlapping 
tracks by keeping the best quality tracks. 

About 20-25% of the L1 muon triggers do not correspond to genuine 
muons. Therefore the High Level Trigger (HLT) uses a more sophisticated 
algorithm to reject most of this background. The first step is to reconstruct 
standalone muons from the muon chambers alone (L2 muons). The ver- 
tex constraint is added to improve the resolution of the pr measurement. 
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Figure 13.6: Measured electron trigger 
efficiency in ATLAS as a function of 
electron transverse energy (Er) for 
two different algorithms (see text for 
details) [98]. 


A similar data-driven approach is 

used to determine the photon trigger 
efficiency using samples of Z — e+ e` y 
in which the tag is given by the et e~ 
pair with an invariant mass below that 
of the Z and an invariant mass of the 
three-body system compatible with that 
of the Z [98]. 


For an overview of the CMS muon 
system see section 12.8. 


The maximum drift time of the DTs 
of (400 ns) is much longer than the 
interval between bunch crossings of 
25 ns. 


The resolution in pr is limited by multi- 
ple scattering for most of the interesting 
range of pr (see section 12.8). 
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This is implemented in software and so can use essentially the same al- 
gorithms as used for offline muon reconstruction. The L2 muons are then 
used to define a region of interest for the track reconstruction in the inner 
tracker. By matching the muon candidates to the tracker, the pr resolu- 
tion is greatly improved, thus allowing further reduction in trigger rates. 
The much greater precision of the inner tracker is then used to reduce the 
trigger rate by applying a sharper threshold in pr. 

Many physics channels involve isolated muons, whereas the muons 
from backgrounds like 2/K decays in flight will typically result in other 
particles in close proximity to the muon. Therefore powerful background 
rejection can be achieved with isolation cuts. The total pr of additional 
tracks and calorimeter clusters within a cone of AR = 0.3 is evaluated and 
used to reject non-isolated muons. The power of the isolation requirement 
is illustrated in LHC Run 2, for which the trigger threshold for the isolated 
muon trigger was pr = 24 GeV/c, whereas the non-isolated trigger used a 
threshold of pr = 50 GeV/c [207]. 

In a similar methodology as used for electron triggers, the muon trig- 
ger efficiency is measured relative to that of selected offline muons. The 
same type of ‘tag and probe’ analysis is used to measure the efficiency 
from data. 
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LHCB SOFTWARE TRIGGER 


The principle triggers for the LHCb experiment target events with hadrons 
containing b or c quarks. The very large cross-sections (see Figure 13.3) 
make it very challenging to design efficient triggers at high luminosity. 
The triggers for the hadronic decays of b and c hadrons are particularly 
challenging as the identification requires the precision tracking to identify 
the particles that do not point to the primary vertex. 

For the LHC Runs 1 and 2 the LHCb experiment used a conventional 
hardware trigger (LO) and a software selection (HLT). In this period the 
peak luminosity was limited to 4 x 10°? cm~*s~!. One crucial develop- 
ment was splitting the HLT into two levels (HLT1 and HLT2) [4]. HLT1 
performs partial event reconstruction that allows selections to reduce the 
rate to 110 kHz. These events are stored on a large disk buffer for long 
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The CPU time required for the complete 
reconstruction of the full inner tracker 
would be too long for the HLT operat- 
ing using a single processor. Parallel 
processing is achieved by providing 
each processor with separate events. 


AR = \/(An)? + (A@)?, where An 
is the difference in ņ between the 
tracks/clusters and the muon track, 
with an equivalent definition for Ag. 


Figure 13.7: Measured muon trigger 
efficiency in CMS from LHC Run 2 as a 
function of muon pr [206]. 


The rate of the LO trigger was limited to 
1.1 MHz. 
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enough for the updated alignment and calibrations to be determined. This 
allows the CPU farm to apply the final alignment and calibration proce- 
dures. The HLT2 (running on the CPU farm) can then make selections that 
are the same as those that would be made in conventional offline analysis. 
This allows for sharp trigger thresholds to be implemented without loss of 
efficiency and this reduces the rate by an additional order of magnitude. 

For Run 3, the target luminosity for LHCb was increased to 
2 x 10°? cm~*s~! [333]. Using the LO trigger would have required raising 
the trigger thresholds and would have resulted in a large loss of interesting 
physics events. Therefore, for LHC Run 3, a major additional improve- 
ment was the change to a full software trigger [334]. 

All events are read out from the detector at the bunch-crossing rate of 
40 MHz and the trigger is performed by the two-level HLT. This requires 
the use of 8800 Versatile Link transmitters (optical links) using a data rate 
of 4.8 Gbps. The Versatile Links are based on a CERN developed custom 
radiation hard optical transceiver [476]. The rates at the different stages 
are summarised in Figure 13.8. 
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The HLT1 runs on GPUs [5] and can accept an input data rate of 
40 Tbit/s using about 500 GPU cards. HLT1 can perform pattern recog- 
nition and identify particles as hadrons or muons. Crucially for the LHCb 
application it can identify decay vertices of long-lived particles. The 
HLT2 software trigger requires the use of a dedicated CPU farm with up 
to 4000 nodes. This allow for a processing time per event of 13 ms. The 
average event size is 100 kB and accepting events for storage at 20 kHz, 
results in an output bandwidth of 2 GB/s [334]. 

The concept of a pure software trigger was also considered for the 
ATLAS and CMS experiments. These experiments contain many more 
channels than LHCb and therefore the costs would have been very high. 
In addition, the radiation levels in the pixel detectors in these experiments 
are too high for the operation of optical links and therefore copper cables 
are used to transfer the data to a lower radiation region where the optical 


Detectors in Particle Physics 


This corresponds to an average number 
of inelastic interactions per bunch 
crossing of 7.6. 


Figure 13.8: Estimated data rates for 
the LHCb all-software trigger. The 

30 MHz input rate is averaged over the 
filled and empty bunches in the LHC. 
The Low Level Trigger (LLT) is an 
optional feature to implement simple 
selections on individual object pr ina 
similar way to that used in a hardware 
trigger [334]. As for the trigger in Run 
1 and 2, the data output from HLT1 is 
stored on disk long enough to apply the 
final alignment and calibrations. 
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transceivers can be placed. The very large number of these electrical links 
that would have been required by a pure software trigger would result in 
too large a mass of copper. 


13.5 TRIGGERS FOR RARE DECAY EXPERIMENTS 


The trigger requirements for rare decay experiments are very different 

to that from collider experiments. In general, the event rates will be very 
much lower. We will consider the LZ experiment as an example (see sec- 
tion 8.2). The rate of signal events is of course extremely small and possi- 
bly zero. However, in order to control the backgrounds it is essential to ac- 
quire suitable data samples. For example, events outside a fiducial volume 
can be used to extrapolate the backgrounds into the signal region. For the 
LZ experiment the estimated background rate is 40 Hz [34]. Higher rates 
are achieved with dedicated calibration sources such as LEDs for which 
the rate will be up to 150 Hz. For these relatively low rates the data can 
be continually digitised and read into a circular buffer without the need 
for a hardware trigger. Data extractors are used to detect interesting events 
and compress the data which is then read out to temporary buffers. The 
Event Builders then assemble all the data from each event for offline anal- 
ysis [34]. 


TRIGGERS FOR NEUTRINO DETECTORS 


The triggers required for accelerator and non-accelerator neutrino exper- 
iments are different. We will consider the triggers used for the MINOS 
experiment (see section 14.2) as an example of an accelerator neutrino 
detector. The neutrino beam has a 1 us “beam spill’ structure, with a repe- 
tition time of 2 s [374]. 

The pixels of each PMT are digitised and the pedestal is subtracted. 
Within the beam spill time window, all hits above a threshold are read out 
to a trigger processor. The trigger processor implements a simple algo- 
rithm for the far detector to select physics events for permanent storage. 
The hits are time-ordered and clusters of hits within about 200 ns are 
defined. The primary selection algorithm uses clusters within a window 
of 200 ns to identify candidate events. The main trigger algorithm for 
MINOS requires that at least n planes out of any group of m contiguous 
planes should have hits within the time window. The instantaneous rate 
of neutrino interactions at the ‘far detector’ is less than 200 Hz, which is 


much lower than the rate of PMT hits from radioactivity and dark currents. 


The Super-Kamiokande experiment (see section 14.2) is an exam- 
ple for a non-accelerator based experiment. It originally had a hardware 
trigger, which triggered a readout of the experiment when the number of 
PMT hits out of the about 13,000 PMTs in the experiment within a time- 
window of 200 ns exceeded a threshold. Depending on the energy thresh- 
old the rate of triggers was between 10 to 1000 Hz. The signals from the 
PMTs were dominated by the dark counts rate, which was about 4 kHz. 

However, to improve the physics reach of the experiment, and to allow 
for the search for relic supernova neutrinos the energy threshold had to be 
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Data extractors and event builders are 
implemented in software. 


The time structure arises from the 
proton accelerator used to create the 
neutrino beams. 


The far detector is located 730 km 
from the accelerator in order to study 
neutrino oscillations. 

Other algorithms are used for calibra- 
tion and background studies. 


Typical values were n = 4 and m= 16. 


PMT dark currents are mostly made up 
of signals due to thermionic electron 
emission in the PMT. 
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lowered, resulting in higher trigger rates, and the time window had to be 
increased. As a consequence, the trigger windows would have overlapped 
and the detection of the physical event would have been made much more 
difficult. Therefore the readout of the experiment was changed to continu- 
ous readout at a rate of 60 kHz with a software event selection in a farm of 
standard commodity PCs [507]. The total data rate this system can process 
is around 800 MB/s. To verify correct operation of the trigger, high rate 
tests, simulating data from a supernova burst by flashing LED light in the 
water tank (2.5M flashes in 10 s) were performed. 
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For this search the time-window had 

to be increased to more than 200 us 

to suppress the most prominent back- 
ground from atmospheric neutrinos, 
which can be rejected by detecting the 
2.2 MeV gamma ray from capturing the 
recoil neutron from the interaction of 
the atmospheric neutrino. The longer 
time window is necessary because of the 
time until capture. 
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Key lessons from this chapter 


e Coincidences can be used to identify geometrical correlations of signals that are rep- 
resenting interesting event topologies. At the same time, coincidences can reduce fake 
rates from individual detectors. 


e Trigger decisions are binary decisions typically based on a continuous variable. A 
sharp turn-on of the trigger as a function of the continuous variable is beneficial. 


e Ina typical particle physics experiments several types of trigger are employed (dif- 
ferent physics targets, calibration data, redundancy). The total rate of triggers must be 
compatible with the capabilities of the data acquisition system. Triggers with a rate 
exceeding this must be pre-scaled. 


e To reduce readout deadtime, data is buffered locally, usually in a pipeline memory. 


e In experiments with a high rate of background events, multi-level triggers are em- 
ployed, to match at each level the trigger rate, the latency and the local buffering 
capacity. 


e Advances in network technology and processing power of standard commodity PCs 
enable the move towards pure software triggers. 


EXERCISES 


1. Coincidence triggers. 


An experiment uses Np photomultiplier tubes (PMTs) for a trigger. The dark rate in each PMT 
is R. The trigger is defined by the requirement that there should be at least Nyits within a time inter- 
val of AT. Explaining any assumptions you make, what is the rate of triggers from coincidences of 
the background? 


2. Trigger purity. 


A secondary beam at an accelerator is produced by collisions of a proton beam with a target. 

This beam has a fixed momentum and is known to consist of 90% a* and 10% K~. A threshold 
Cherenkov counter is used to select K+. The efficiency for the counter to trigger on z+ (K+) is 
99% (1%). How can the signal from the Cherenkov detector be used to select K*? What would the 
purity (fraction of K*) be for the triggered events? 


3. Muon lifetime measurement. 


A scintillator setup consisting of three scintillators is used to measure the lifetime of cosmic 
muons with a TDC. The cosmic muon comes to a stop in Sı and decays to an electron (and two 
undetectable neutrinos). 
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a) Find logical expressions describing coincidences providing a robust start and stop signal for 
the TDC. 


b) Ifthe rate of cosmic muons is about | per second, and the hit rate from the PMTs is | kHz 
(mostly due to thermal noise), how long would the duration of the scintillator signals need to 
be to keep the rate of fake coincidences at an acceptable level? 


c) Why is it sensible to give the inverted signal used in the anti-coincidence a longer duration? 
4. Trigger rates. 


a) Derive eq. (13.3). 

b) Consider a trigger system using N buffers to store the data while a trigger processor deter- 
mines whether to keep or reject the event. Let the mean arrival rate of triggers be A and the 
mean trigger processing time be u. The arrival times of the events and the processing times 
are distributed exponentially. Write down the probability that there was one event in the 
queue at time ¢ + dt if there was one event in the queue at time t. Hence show that for equi- 
librium for the case n Æ 1 


UPh+t +APi-1 = (A +u)P, = 0, 


and for n = 1 
APo = uP 
c) Using the results from b) show that P, = p™ Po, where p = À /u. 


d) The buffer is full and there is deadtime if n = N. Solve for Py (i.e. the deadtime fraction) by 
requiring that Ey P,=1. 


e) For the case p = 0.3 what is the deadtime fraction with no buffers. How many buffers would 
be required to reduce the deadtime fraction to be <5%? 


4 Detector systems and 
applications 


In this chapter we will pull together all that we have discussed in previous 
chapters, and see how different detector technologies can work together 

to combine into powerful experiments that are optimised for physics mea- 
surements, but can also suppress backgrounds efficiently. We will also 
discuss some of the challenges the environment of the experiments and the 
integration of the different systems impose. 


14.1 COLLIDER DETECTORS 


Modern colliders and their associated experiments constitute a large in- 
vestment, financially and in time and effort. To maximise the scientific 


output, general purpose collider experiments are designed for a broad pail aioe ani aa 
: , oa ‘ ‘arget more specific physics, e.g. 
range of tasks, which also gives the capability for internal cross-checks which is optimised for heavy flavour 


and complementary and mutually supporting measurements. These detec- —?"ysies (b and c-hadrons). 
tor systems are designed to have good resolution for the kinematic prop- 

erties of electrons, photons, muons and jets, as well as missing transverse 

energy. In addition, there is a need to identify jets originating from b or c 

hadrons (see section 12.12) and T (see section 12.9), and detached vertices 

from the decay of short-lived particles. 

Considering the different amounts of material in different detector 
technologies, the effect of upstream material on their measurement, the 
different typical channel density and their cost per unit area, there is an 
obvious sequence of detector technologies a particle will be presented 
with as it makes its way from the interaction point (IP) through the col- 
lider experiment. 


Figure 14.1: A slice of the CMS detec- 
tor, showing schematically the response 
to different particles [115]. 


Electromagnetic 


Calorimeter 
Hadron 
Calorimeter Superconducting 
Solenoid on 
Muon Electron —— Charged hadron (e.g. pion) 
===- Neutral hadron (e.g. neutron) — ----- Photon 


The innermost detector layers typically have the task to reconstruct 
primary and secondary vertices. As some short-lived particles, for ex- 
ample mesons containing c or b quarks, or taus have short proper life- 
times (ct < 1 mm), these layers should be as close as possible to the 
primary interaction. However, the radius of these vertex detector lay- The LHCb detector is an exception in 
ers is normally limited by the beam pipe, the track-density resulting ina 1⁄4 the VELO detector can be inserted 


inside the beam pipe 
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high hit-occupancy and the high background particle fluences close to the 
beam, which cause radiation damage in the vertex detectors. To achieve 
the goal of good vertex reconstruction in very high track densities these 
detectors must be highly segmented, making semiconductor pixel detec- 
tors the obvious choice for these detectors. The detectors should also con- 
tain very little scattering material. Radiation-hard techniques are a must 
for hadron colliders, for example 3D sensors. 

The vertex layer is then surrounded by further tracking layers. The 
combined tracking detector measures the momentum of the charged par- 
ticles in a magnetic field. The standard magnet configuration for tracking 
detectors is a solenoid, as it gives good uniform pr resolution. To achieve 
a good momentum resolution, a high magnetic field and a long lever arm 
are required. In hadron collider experiments, where event rates and track 
densities are high, fast and accurate position detectors are required, mak- 
ing currently silicon strip detectors the most capable detection technology. 
However, progress in MAPS detectors and other semiconductor technolo- 
gies will bring pixelated readout into the tracker in future experiments. In 
experiments where fast response is not a concern, like in heavy-ion or lep- 
ton colliders, gaseous TPCs are used, as they provide a large number of 
space points, often with dE/dx information, with minimal scattering ma- 
terial in the tracking volume, albeit with a poorer position resolution per 
point. 

Depending on the physics priorities of the experiment, either as part of 
the tracking detector or as dedicated layers just after the tracker, particle 
identification detectors (see chapter 12) can be employed. 

Outside of the tracker, the calorimeters are placed, usually divided into 
an inner electromagnetic part, and an outer hadronic calorimeter. They 
are the key detector system to detect neutral particles, and they are critical 
for the determination of the missing transverse energy. To achieve the lat- 
ter, the calorimeters have to extend up to a high pseudorapidity, typically 
|n| ~ 5. In practice it is very difficult to extend the coverage beyond this 
range because the calorimeters have to be outside the beam pipe. 

A key design consideration in every collider experiment is the position 
of the solenoid coil for the central tracking system. If it is placed before 
the calorimeter its material will degrade the energy resolution. One way 
to compensate for this is the use of a pre-shower detector. If an electron 
interacted in the material upstream of the calorimeter it will result in a 
large signal in the pre-shower detector which can be used to compensate 
for the energy lost in the passive material in front of it. 

Placement of the calorimeter inside of the solenoid coil increases the 
size and cost of the magnet. To limit this, the calorimeters have to be very 
dense. In addition, their readout has to work in the strong magnetic field of 
the central tracker. 

In any case, the field lines of the magnet must be closed. To prevent 
excessive stray fields, closure of the field lines is usually accomplished 
by an iron return yoke, which can be used as the absorber of the hadronic 
calorimeter (as in ATLAS [85]), or for a solenoidal bending field for the 
muon system (as in CMS [196]). 

The muon detection system is located outside the calorimeters. Due 
to the large size and the moderate track densities even at hadron colliders, 
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The beam pipe might also require a 
thermal insulation layer to allow for 
bake-out of the beam pipe to remove 
surface contaminants that could de- 
grade the beam pipe vacuum. The 
degradation of the vacuum occurs when 
these surface contaminants are hit by 
stray beam particles. 


LHCb [332] differs from the layout of 
a standard collider experiment, as it is 
optimised for the detection of particles 
in the forward direction and thus it does 
not have a solenoidal spectrometer, but 
a dipole. 


There is a trade-off in cost and perfor- 
mance which will determine at which 
radius there is a switch between pixel 
and strip detectors. 


The differences between electromag- 
netic and hadronic calorimeters can 
be in the construction (homogeneous 
vs sampling), choice of absorber Z 
and/or segmentation. Particles don’t 
distinguish between them. They start a 
shower by the interactions they undergo 
in the material presented. Typically 
about half the energy of an incident 
hadron is deposited in the electromag- 
netic calorimeter. 


Another motivation for a finely grained 
pre-shower detector is the identification 
of single photons in the presence of 
backgrounds from n° — yy which is 
crucial for the study of the Higgs decay 
H > yy. 
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gaseous detectors are used. Precision layers are complemented by trigger 
layers that are faster but have a coarser granularity. 

Triggering is a critical issue for every collider detector, but in particu- 
lar for experiments at hadron colliders (see chapter 13). 


SERVICES 


The aim is to have a solid angle cover- 
age as close to 47 as possible. There- 
fore, general purpose collider detectors 
are sometimes referred to as 4% detec- 
tors. 


The goal for the overall layout of a typical 47 collider detector is a com- 
pact integration of all the subsystems while maintaining full hermeticity. 
One key consideration in this are the services that are required to operate 
the detectors, as they constitute significant dead material, and the services 
for the inner layers need to penetrate the outer layers. 

Typical services in a collider detector are: 


e High voltages for drift and amplification fields, bias voltages, etc. 
The voltages typically range from about 100 V for semiconductor 
detectors to a few kV for wire chambers, with currents that are typ- 
ically low (<A). It is typically sufficient to segment the supply 
for larger sections of the detector, with internal distribution, and the 
typical number of cables for a system is between 10? and a few 10°. 


e Electrical power for front end-electronics, with a typical supply 
voltage between 1 and 5 V DC. In particular if the front-end elec- 
tronics has to provide digitisation and buffering, supply currents 
can be very high (several kA), requiring large conductors. For high 
power inner systems like semiconductor trackers improved power- 


ing techniques are thus developed, which provide the power with 
reduced current by increasing the voltage in the supply cables. Two 
strategies are pursued: Serial powering or DC-DC conversion. Be- 
cause of the high power in each channel and the need to decouple 
individual modules, typically high granularity of the supply is re- 
quired (10° to 10* cables per system). 


Gases for gaseous detectors or environmental gas supply. These are 
supplied through tubes and hoses, the size of which limits the num- 
ber to 10 to a few 100 per system, typically with further distribution 
inside the detector. 


Cooling. There are two main needs for cooling in a detector. First, 
all the heat generated by detectors and front-end electronics needs to 
be removed from the detectors. This can be a substantial amount 
(several 10 kW for typical semiconductor systems at LHC). For 
high-power systems this requires liquid or even evaporative cool- 
ing (see section 9.6), whereas in low power detectors, for example at 
lepton colliders, gas cooling might be sufficient. 


The second need for cooling is the supply of liquids to cryogenic 
detectors and superconducting magnets. These are typically cryo- 
genic liquids like LN, or LAr. 


Cooling lines typically need to be thermally insulated to prevent 
condensation or formation of ice. Their size limits their number to 
typically 10 to a few 100 per system, with further distribution inside 
the detector. 


In serial powering (see for exam- 

ple [265]) loads are connected in 
series like the bulbs in a Christmas 
tree lighting system. In DC-DC con- 
version systems (see for example [26]) 
the voltage is reduced by storing the 
input energy temporarily in a coil or 
capacitor, and then switching to release 
that energy to the output at a different 
voltage (see exercise 3). 


In addition to heat removal capacity 
(power) there might also be require- 
ments for the temperature of the coolant 
(for example to maintain thermal stabil- 
ity for radiation-damaged semiconduc- 
tors). 
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e Detector control system (DCS), sometimes also called ‘slow con- pa a experiments, see 
trol’) for monitoring of detector system parameters (T, V, J, etc.) 
and control (setting parameters in the front-end electronics). These 
tasks are typically (but not necessarily) slow and the signals are low 
voltage/low current. Often these systems are multiplexed, so that a 
few 10 to 100 lines are sufficient. For safety-critical tasks interlock 
lines (monitoring and control) are needed. For high reliability these 
might require individual hardware lines, which can result in signifi- 
cant numbers (several 1000) in large systems. 


e Data readout. Highly segmented detector systems like pixel detec- 
tors can have up to several 10? channels. For such large channel 
numbers on-detector data reduction is required (zero-suppression, 
multiplexing, etc.). Electrical lines have limited bandwidth and rel- 
atively high mass, and thus optical links are often used at speeds For a review of opto-electronics tn the 
i LHC experiments see [476]. 
of several hundred Mb/s to a few Gb/s. As the power of data lines 
is low, electrical read-out lines can be small, but many are needed 
(103 to 10°). Optical lines can be fewer after multiplexing, but they 
are more delicate, and need proper protection. The opto-electronic 


conversion introduces additional complexity to the readout system. Opto-electronics components are gen- 
erally not sufficiently radiation-hard 


for the use at the inner radii of LHC 


RADIATION ENVIRONMENT AND ACTIVATION experiments. 


In particular at hadron colliders the radiation environment is a major de- 

sign driver. The radiation fields in the LHC experiments are dominantly 

produced by beam collisions, with contributions from beam-gas interac- The neutrons arise predominantly 

ti doth hi l Th t t d the b : from spallation in the calorimeters. 
ions and other machine losses. They are strongest around the beam pipe, The energy of the neutrons can be 

in particular close to magnets and collimators, but there is also a more uni- reduced by elastic scattering to an 

f t duet tr energy below threshold for causing 
orm component due to neutrons. radiation damage. In practise this is 

achieved with polyethylene moderators. 
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Figure 14.2: Total ionising dose in Gy(Si) (left) and 1 MeV neutron equivalent fluence in silicon (right) from GEANT4 simulations of the ATLAS 
detector, shown for a quadrant of the detector (beampipe along the horizontal axis) [101]. Note the increased radiation levels close to the beam pipe, 
leakage along the service gap between the barrel and the endcaps (at z = 350 cm), and increased hadron fluences due to the showers in the dense 
material of the endcap calorimeter (370 cm < z < 600 cm). 


Effects of radiation backgrounds are 


e Increased particle rates in the detector. This leads to increased oc- 
cupancy and deadtime. The former creates challenges for pattern 
recognition and separation of signals, while the latter limits data 
taking rates. 
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Radiation damage to detectors. In semiconductor detectors this 
leads to bulk damage and consequently increased leakage currents, 
to type inversion and charge trapping which leads to a loss of sig- 
nal (see section 9.11). In wire chambers this leads to creation of 
polymers, and thus increased leakage current and loss of gain (see 
section 7.7). In scintillation detectors it leads to loss of transparency 
and thus signal (see section 5.6). 


Damage to front-end electronics. This can be minimised using mod- 
ern deep sub-micron technologies and appropriate design tech- 
niques. 


Radiation damage to materials. The very high radiation levels at 
hadron colliders can also also cause very significant effects on pas- 
sive material. For example all plastics will deteriorate with radiation 
damage and it is essential to select an appropriate plastic for a given 
application. Similarly some adhesives are much more radiation tol- 
erant than others. Another example is the increase in attenuation in 
optical fibres due to ionising radiation. Custom radiation-tolerant 
fibres have been developed and are used in very high radiation envi- 
ronments. 


Single event effects (SEEs) in electronics (see section 3.9). 


e Activation of detector components. This makes personnel access for 


maintenance or replacement of parts progressively more challenging 
with time. This, together with the compact designs of detectors and 
services, and the need for environmental barriers, can make access 
to inner parts of detectors at hadron colliders prohibitively difficult. 
Highly reliable technologies and redundancy are therefore a major 
design requirement. 


Dealing with the consequences of the radiation environment often re- 
quires compromises in the detector design. Sometimes this involves a di- 
rect trade-off between the performance of the detector and its successful 
long term operation in the radiation environment. The demands of design- 
ing a detector that can sustain radiation damage can drive requirements for 
the environment, like for cooling of silicon detectors, and can influence 
the choice of detector technology and segmentation. 

Due to the high importance of the radiation effects in collider de- 
tectors at the LHC, reliable predictions of the radiation fields have been 
an essential prerequisite for the design of these detectors. As the dom- 
inant source of the radiation is from beam-beam interactions, the start- 
ing point of these predictions are Monte Carlo event generators such as 
PYTHIA8 [458, 459] and DPMJET-II [430]. The particles originating 
from the proton-proton collisions interact with the material in the detector 
and the accelerator (collimators, focusing magnets, etc.), causing electro- 
magnetic and hadronic showers which give rise to the complex radiation 
fields. For the prediction of these, Monte Carlo transport software like 
GEANT4 [29, 44, 45] or FLUKA [120, 151] are being used. 
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Data on radiation hardness of different 
materials has been compiled in a series 
of CERN technical reports [471, 470, 
134, 340]. 


Commercial products are not designed 
to be radiation tolerant. Companies 
can change processes in subtle ways 
which affect the radiation tolerance. 

It is thus important to perform tests to 
verify radiation hardness, also during 
production on batch samples. 


Heavy metals can undergo radioactive 
transmutation to radioactive isotopes. 
Therefore the use of metals like gold 
and silver should be minimised. 


For example limiting ageing effects in 
wire chambers will require reducing the 
gas gain. 


For a detailed review of radiation 
environments, their prediction, and the 
effects of radiation for LHC detectors 
see [219]. 
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14.2 NEUTRINO EXPERIMENTS 


For experiments in neutrino physics there are very different types of de- 
tectors, depending on the energy of the neutrinos to be studied. In general, 
the weak interaction neutrino cross-sections are very small, so large de- 
tectors are required. For experiments studying neutrino oscillations over 
very long lengths it is necessary to use a ‘near’ detector (i.e. close to the 
production target) to measure the flux and a ‘far’ detector to measure neu- 
trinos after they have oscillated. The rates in the near detector will be large 
but the rates in the far detector will be very low, so in general very differ- 
ent detector types will be used for near and far detectors. The far detector 
must be capable of identifying the neutrino flavour to be sensitive to neu- 
trino oscillations. 

For low energy neutrinos, there will be significant backgrounds from 
radioactivity and cosmic rays, and therefore similar precautions will be 
required to those used in the experiments searching for rare events (see 
section 14.3). 

The detection of low energy (MeV) anti-neutrinos uses inverse p de- 
cay, Vep — e*n. The positron will annihilate with an electron in the detec- 
tor, ete” — 2y, and the ys can be detected in a scintillator. The neutron 
will be captured by a nucleus leading to an excited state that decays radia- 
tively to the ground state. This process occurs on a longer time scale (in 
the order of us), and therefore the signal for an Ve interaction is a prompt 
signal, followed by a delayed signal. 

A modern example of this approach is provided by the Daya Bay ex- 
periment [226]. To capture the neutrons, gadolinium is added to the liquid 
scintillator. This has two advantages: 


e the large cross-section for neutron capture by Gd reduces the 
timescale for this process and thus reduces the backgrounds from 
random coincidences; 


e the radiative decay of the excited Gd nuclei results in an 8 MeV 
y signal, which is a higher energy than that produced by radioactive 
backgrounds. 


The Gd-liquid scintillator is contained in a transparent acrylic vessel to 
allow the scintillation light to be viewed by photomultipliers. The liquid 
scintillator must be chemically compatible with acrylic and stable over a 
period of years. Linear alkyl benzene (LAB) was chosen as the scintillator 
because of its good scintillation yield and compatibility with acrylic. 


WATER CHERENKOV NEUTRINO DETECTORS 


A different approach to the detection of neutrinos is to use very large wa- 
ter Cherenkov detectors. These are very large vessels filled with pure wa- 
ter and viewed with an array of photomultipliers. The experiments are 
typically located in an underground cavern to provide shielding against 
cosmic rays. 

The Super-Kamiokande (SK) experiment [254] uses 50,000 tons of 
ultra-pure water in a cylindrical steel vessel. The Cherenkov light from 
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Requiring the capture signal can be 
used to give a very big reduction in 

the radioactive backgrounds and this 
technique was used in the discovery of 
the Ve, where the neutron was captured 
by cadmium nuclei in the scintillator 
solution of the detector [421]. 


This experiment measured the Ve fluxes 
at different distances from a nuclear 
reactor, in order to study neutrino 
oscillations. 


Acrylic has lower radioactivity than 
glass. 


Earlier experiments had problems with 
the scintillator turning yellow. The 
resulting attenuation length decrease 
rendered the detectors inoperable. 


The chemical structure of LAB is 
CoHsCpAon41, with n in the range 
from 10 to 13. 


An even larger detector using these 
principles (HyperK) is planned for 
the study of CP violation in neutrino 
oscillations [344]. 
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relativistic particles is detected by an array of PMTs. As discussed in sec- 
tion 12.4, the pattern of hits in this array is given by the intersection of the 
Cherenkov cone with the detector array. 

The volume of the SK experiment is divided into an on Outer Detec- 
tor (OD) and an Inner Detector (ID). The ID and OD are optically sepa- 
rated by a stainless steel framework. The ID is the actual fiducial volume 
of the experiment. It is instrumented with 11,146 hemispherical PMTs 
with 50 cm diameter. This provides an effective area coverage with pho- 
tocathode surfaces of 40%. The purpose of the OD is to detect and veto 
incoming charged particles. It is instrumented with a smaller number of 
PMTs. As there is a lower area coverage of the PMTs in the OD two tech- 
niques are used to increase the light collection efficiency. Firstly, wave- 
length shifter (WLS) plates are used to shift the UV light to the blue/green 
region, for which the quantum efficiency of the PMTs is higher. Secondly, 
a reflective liner is used on the outer walls, which spreads the light over 
several PMTs. This degrades pattern recognition, but the increased light 
yield improves the energy resolution. 

In order to minimise radioactive backgrounds and to ensure a very 
long attenuation length for the Cherenkov light, the water is purified in 
multiple purification systems. For calibration a variety of systems are 
used. These include a system with a laser and CCDs to monitor the trans- 
parency of the water. A xenon light is used to measure the relative gains 
of the PMTs. A nickel source is used to produce 9 MeV photons from ther- 
mal neutron capture on nickel using the reaction BNi(n, )°Ni [11]. The 
activity of the source is such that the probability of detecting two photons 
is less than 1%. The resulting spectrum shows the clear peak from single 
photoelectrons. Finally, a low energy linear electron accelerator is used to 
inject electrons of precisely known energy. 

In addition to the array of hardware calibration systems, in-situ cal- 
ibration procedures were used based on through-going muons, stopping 
muons and the resulting decay electrons (U~ — e Vy, Ve). The energy res- 
olution for events with a single muon ring was around 2% for events with 
a visible energy around | GeV [294]. The energy resolution for low energy 
electron neutrinos is [10] 


Ox /E = —0.123/(E/MeV) + 0.376/.\/E/MeV +0.0349. 


TRACKING CALORIMETERS 


For higher energy (multi-GeV) neutrinos produced from accelerators, 
detectors with a large amount of material are required to obtain a sig- 
nificant signal rate. Therefore a typical approach has been to use an 
iron/scintillator calorimeter. In such a detector, electrons from v.X — e Y 
can be identified from the electromagnetic showers in the calorimeter. The 
muons from VX — u` Y can be identified by penetrating tracks that do 
not shower. In addition, if the iron is magnetised, the momentum of the 
muons can be reconstructed from the curvature of the tracks. 

We will consider the MINOS experiment [371] as a modern example 
of this approach. The MINOS detector used alternating planes of scin- 
tillating strips and 2.54 cm of steel. The scintillating strips were 4.1 cm 
wide, | cm deep and up to 8 m in length. In order to minimise the effect 
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In 2002, during filling of the tank with 
water, one PMT imploded, which then 
caused a shock wave that damaged 
further PMTs and created a cascade 
which resulted in over half the PMTs 
being destroyed. These PMTs were 
replaced and protection was introduced 
to prevent such an incident in the future. 


As the OD is used as a veto detector, 
this is an overall performance improve- 
ment. 
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Figure 14.3: Charge signal spectrum 

(single photoelectron peak at about 

2.5 pC) measured with the Super- 

Kamiokande PMTs using the *8Ni 

y source [11]. 


The first term which represents the noise 
has an unphysical negative value but 
this is an artefact of the fit. 


This is required for long baseline neu- 
trino oscillation experiments. 


For these type of experiments, the 
calorimeter serves as the target and the 
detector. 


If the muon stops inside the detector, 
the energy can be more precisely deter- 
mined from the range. 


See also section 12.11. 
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of attenuation for these very long strips, the scintillating strips were read 
out by wavelength shifting (WLS) fibres. The steel was magnetised by 
toroidal coils, providing an average magnetic field of 1.42 T. 

The resolution for the muon momentum measurements was dominated 
by multiple scattering, and was Ap/p = 12% for the typical muon mo- 
menta in the experiment. In addition, for low energy muons that stopped 
in the detector the momentum could be determined from the range of the 
track. 

The resolution for electromagnetic showers was limited by the thick- 
ness of the steel plates. Improved resolution could have been achieved 
with thinner plates, but for a fixed detector budget this would have 
resulted in a lower target mass and hence a lower number of signal 
events. The calorimeter energy resolution for electron showers was 
Og /E = 24%/\/E/GN ©4%/(E/GeV) and for hadron shower measure- 
ments og /E = 56%/\/E/GeV ©2%. The total mass for the ‘far’-detector 
was 5400 tons. 

Several calibration systems were required to achieve this performance. 
These included a UV LED system to inject light into the WLS fibres. This 
was used to monitor the uniformity and stability of the PMTs and associ- 
ated electronics. In addition to the electronic and optical calibration sys- 
tems it was essential to make an accurate calibration of the energy scale 
for electromagnetic and hadronic showers. This was done using a smaller 
calorimeter made with the same steel and scintillator technology. This 
calorimeter was exposed to beams of low energy electrons, muons and pi- 
ons to determine the absolute energy scale. The cross-calibration with the 
MINOS detectors was performed using signals from cosmic ray muons. 

A new approach to large mass neutrino experiments is using a liquid 
argon TPC as described for the DUNE experiment in section 8.1. This 
provides very good tracking as well as calorimetric information. 


14.3 PARTICLE DETECTORS FOR RARE EVENTS 


Examples for experiments targeting rare events are searches for dark mat- 
ter or neutrinoless double B decay (Ov 8). The general requirements for 
these type of experiments are: 


e A large active mass and a long running time; 
e High efficiency for detection of the signal event; 


e Backgrounds must be low enough so that they do not obscure the 
search signal. 


Sometimes it is possible to perform useful searches even if the back- 
ground rates are larger than the expected signal rates, if the background 
rates can be accurately calculated. However, in this case the sensitivity 
will be limited by Poisson-statistical fluctuations in the number of back- 
ground events. Thus the sensitivity will be proportional to \/Mtarget X t, 
where Mtarget is the number of target atoms and f is the duration of the ex- 
periment. On the other hand, if the background rates can be significantly 
lower than the signal, the sensitivity will be proportional to Marget X t; 

The first way backgrounds can be reduced is by placing the detectors 


in deep underground laboratories, which provides shielding against cosmic 
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The WLS fibres absorb light around 
420 nm and re-emit light at 470 nm, 
which greatly increases the attenuation 
length. Outside the active detector 
region the light is coupled to clear 
fibres as these have an even longer 
attenuation length. 


A similar but smaller detector was 
located near to the accelerator neutrino 
source. 


The exposure was performed at a 
CERN PS test beam. 


Conventional particle detector searches 
for dark matter generally target WIMP 
dark matter. Weakly Interacting Massive 
Particles (WIMPs) are a candidate for 
the dark matter in the Universe. They 
would interact with ordinary matter 
with a strength similar to the weak 
interaction and would have masses of 
@(100 GeV). Very different technologies 
are required to search for other possible 
types of dark matter. 
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rays. Even with this shielding there will be a significant flux of muons. If 
the active detector is surrounded by a muon detection layer, then candidate 
events caused by incoming muons can be vetoed. The simplest method is 
to use large area scintillators coupled to photomultiplier tubes to detect the 
muons. Alternatively, if the active detector can provide 3D spatial infor- 
mation, then events near the outer surface of the detector can be vetoed. 

In addition, great care must be taken to minimise the radioactivity of 
all materials used in the construction of the detector, including the cables. 
Other techniques for reducing backgrounds are more specific to the par- 
ticular combination of the event topology of the search and the employed 
detector technology, 

These experiments are extremely challenging in very different ways to 
collider and neutrino experiments and a very diverse range of approaches 
are being developed. A few examples will be considered here. 

An example for a rare decay search is the Majorana experiment [52]. 
The characteristic signature searched for is a double-B decay without the 
emission of neutrinos, 7X > z 2YB P. The standard model background 
is conventional double-B decay 4X > 7 exp P VeVe. The Majorana ex- 
periment uses Germanium enriched in the isotope Ge which undergoes 
double-B decay with a half life of Ty jo = 1.78 x 107! years. 

Apart from eliminating cosmic rays and radioactive backgrounds, 
we need to separate the signal from conventional double-f decay. The 
combined energy of the BB will be equal to the Q-value of the reaction 
for the signal, whereas it will show a broad distribution between 0 and 
the Q-value for the background. Therefore very powerful background re- 
duction can be achieved if sufficiently good energy resolution can be ob- 
tained. 

The Majorana detector is optimised for good energy resolution, which 
can be achieved by semiconductor detectors. The energy resolution at low 
energy of a Ge detector is limited by the electronic noise, therefore we 
want a large detector with a minimum capacitance (see section 3.4 for 
the discussion of how electronic noise varies with capacitance). This is 
achieved with a geometry called a point contact (PC) detector. 

The point contact geometry has been originally proposed for n™ con- 
tacts [346]. If we approximate the contact as a hemisphere of radius r then 
the capacitance is given by C ~ 27€,€gr. For germanium and with a real- 
istic value of r of 1 mm, this gives a capacitance of about | pF, which is 
much smaller than in conventional Ge gamma spectrometers. 

If there is a uniform impurity concentration, then as the bias voltage 
is increased and the detector is depleted, there would be a radial electric 
field, such that electrons would drift towards the axis and not towards the 
nt electrode. The weighting field will only be large near this electrode, 
so only a small signal will be induced. We need a way to alter the elec- 
tric field to give it an axial component. This can be achieved with an axial 
impurity gradient of the doping. 

In the Majorana detector central p* contacts are used instead of n*, 
for two reasons. First, the ~0.5 mm thick n* layer generated by drifting 
lithium into the lattice covers most of the outer surface of the detector, 
resulting in lower sensitivity to very low energy minimum-ionising back- 
grounds (surface betas, x-rays, etc.). Second, with central n* contacts this 
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This experiment is designed to search 
for Majorana neutrinos (a neutrino that 
is identical to its anti-neutrino). 

The detector must contain an appropri- 
ate isotope for which double-B decay is 
energetically possible. 


m 
n 

Figure 14.4: Schematic cross-section 

of a cylindrical point contact detector 

showing the electrical contacts. 


This can be achieved as a result of the 
segregation of impurities that tends to 
occur during crystal growth. 
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geometry suffers from trapping of electrons as they travel over the long 


distances through the crystal to the n* electrode in that geometry, severely 


degrading the energy resolution [346]. p-type detectors suffer less from 
this effect. 

The detected signal is corrected for the differential non-linearity. The 
energy scale is determined from the y peaks from radioactive decays. The 
FWHM in a reduced size demonstrator for the Majorana experiment was 
2.53 + 0.08 keV at an energy of 2039 keV [52]. 

Apart from the excellent energy resolution, detectors with this geome- 
try can identify backgrounds with multiple interactions (for example mul- 


tiple Compton electrons from y backgrounds). As nearly all the signal will 


be generated near the p* electrode, where the weighting field is large, the 
signals from spatially separated electrons will result in several pulses, as 
opposed to the signal which should be a single pulse. 


(ns) 
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The demonstrator consisted of 58 high 
purity Ge detectors with 14.4 kg of 
natural Ge detectors and 29.7 kg of 
detectors enriched to 88.1+0.7% "Ge. 
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Figure 14.5: p-type point contact (PPC) detector. Left: Electric field intensity (color pattern), drift trajectories (black lines), and drift time isochrones 
(grey) in the PPCs for the Majorana demonstrator [12]. Middle and right: Effect of electrode geometry on the pulse shape measured with an oscillo- 
scope for a multiple-site gamma interaction [111]. Middle: conventional geometry. Right: PPC detector. The top trace in each display corresponds 

to the preamplifier output, the bottom trace is the output of a timing filter amplifier (TFA) set at 10 ns differentiation and 10 ns integration time 


(essentially a bandpass filter). 


In order to minimise external backgrounds the Majorana experiment 


also includes different layers of passive shielding. The innermost shielding 


layer uses ultra-pure copper. Additional shielding is provided by further 
layers of copper, lead and boroated polyethylene (used to moderate the 
neutrons, i.e. reduce their energy by elastic scattering on hydrogen atoms, 
and to absorb neutrons). The detector is surrounded by an active muon 
veto system. 


COMBINED READOUT 


The signature of an interaction of one dark matter candidate, a weakly 
interacting particle (WIMP), with ordinary nuclei is the recoil of the nu- 
cleus. Typical background events in these experiments are caused by 
electrons and ys. Signal and background can be distinguished in a sub-K 
calorimeter if the calorimeter measurement is complemented by another 
measurement (scintillation or ionisation), because the yield for the com- 
plementary signal is significantly different for the same amount of heat 
deposited. Typically, less than 40% of the ionisation (in semiconductor 
detectors) and less than 10% of the scintillation light (in scintillating crys- 
tals) produced by electrons or photons is observed for nuclear recoils with 
the same heat deposit. 


The copper was produced underground 
to minimise cosmogenic activation. 


The isotope 10B, which has a natural 
abundance of 20%, has a large cross- 
section for neutron capture for low 
energy neutrons. 


The energy resolution from these signals 
is considerably worse than for the heat 
measurement, as the energy required 

to produce an electron/hole pair or 

a scintillation photon is larger. For 

the same reason, the resolution of the 
measurement of ionisation charge is 
better than for scintillation. 
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The EDELWEISS experiment [68] uses 36 bolometers made of 


germanium, each with a mass between 820 and 890 g, operating at 18 mK. 


At this temperature the nuclear recoils cause an increase of temperature 
of roughly 1077 K per keV. The temperature of each bolometer is mea- 
sured with two NTD germanium sensors. Ionisation charge is read out by 
200 nm thick Al electrodes that are deposited on the surface of the germa- 
nium crystal in the form of annular concentric rings 150 um wide with a 
2 mm pitch. The germanium crystals are biased by a small potential dif- 
ference applied to electrodes at opposite faces of the crystals. The signal 
is measured by the integrated charge induced by the drifting electron-ion 
pairs. Consecutive electrode rings are operated at different bias voltages to 
separate the volume close to the surface of the crystal, for which the ioni- 
sation signal can be reduced by surface effects like charge trapping, from 
the bulk volume. 
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As the electrons and holes drift in the electric field they acquire energy 
from the field, which is ultimately converted into phonons. Therefore the 
measured phonon energy Ey is larger than the actual nuclear recoil energy 
E; that we want to measure, 


Ep =E + VbiasQ, 


where Vpias is the voltage difference of the electrodes on the two sides of 
the detectors and Q is the ionisation charge. For a large Vpias, the addi- 
tional phonon signal will obscure the phonon energy from the nuclear re- 
coil, and we will lose sensitivity. 

An example for a combined heat/scintillation readout calorimeter is 
CRESST III [8]. Here the detector consists of CaWO, crystals that have 
a size of 20 x 20 x 10 mm? and a mass of about 24 g each. The crystal 
is held by ‘sticks’ made of the same material. The temperature readout 
is provided by Transition Edge Sensors (TES). The CRESST-III detector 
used a thin tungsten film (thickness 200 nm), operated at around 15 mK. 
The very small signal can be read out by very sensitive Superconducting 
Quantum Interference Detectors (SQUID). 

The scintillation light is detected by a bolometer, made of a 4 mm 
thick square silicon-on-sapphire wafer of 20 mm edge length, also held 
by CaWO; sticks and equipped with a TES. 


(14.1) 
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The lower Z value for silicon and ger- 
manium improves the sensitivity to 
low mass WIMPs compared to the use 
of xenon in the LZ detector (see sec- 
tion 8.2). 

For a discussion of NTDs see sec- 
tion 11.2. 


Figure 14.6: EDELWEISS III detec- 

tor [68]. Left: Cross-section through a 
detector. Shown are the electrical con- 
nections with interleaved electrodes at 
different potential, and the drift lines of 
the electrons in the surface (dark) and 
bulk (light) regions. Right: Ionisation 
yield versus recoil energy for a neutron 
calibration using an AmBe source. 

The two red (blue) solid lines delimit 
the 90% C.L. nuclear (electron) recoil 
band. Purple dashed lines correspond 
to inelastic scattering of neutrons on the 
first (13.28 keV) or the third (68.75 keV) 
excited state of Ge. 


In practice this means that Vpias should 
be as low as a few Volts. As the detec- 
tor is operated at temperatures below 
50 mK the intrinsic carrier concentra- 
tion is extremely low, so the leakage 
currents in the silicon (germanium) 
crystals are sufficiently low that there 
is no need to use the strong reverse 
bias normally used in semiconductor 
detectors. 


See also section 11.2. 


The advantage of a measurement of 
the scintillation light is that the photon 
detector does not need to physically 
connect to the calorimeter, and thus 
does not influence its thermal proper- 
ties. 
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Figure 14.7: CRESST III detector [8]. 
Left: Schematic of a detector module 
(not to scale). Parts in blue are made 
of CaWO4, the TESs are sketched in 
red. Right: Neutron calibration data in 
the light yield versus energy plane. The 
bands are fits for P /y-events (blue), 
nuclear recoils off oxygen (red), and 
tungsten (green), where the respective 
lines correspond to the upper and lower 
90% boundaries of the respective band. 
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PARTICLE DETECTORS IN SPACE 


Particle detectors in space-borne experiments use the same detector tech- 
nologies as the earth-bound experiments we have discussed so far. How- 
ever, there are different integration and operational challenges: 


Detectors must withstand the stresses during launch; 


The power available to space-borne experiments is limited and the 
heat generated in the detector must be removed by a cooling sys- 
tem and radiated to space (which is at 2.7 K). Temperatures can vary 
significantly, depending on the amount of infrared radiation the ex- 
periment is exposed to (for example facing or opposing the earth). 


The experiment must not produce any stray magnetic field outside 
of the experimental volume; 


The data rate from the detector must be compatible with the band- 
width of the satellite’s downlink; 


Access for repairs and maintenance is impossible, or at least ex- 
tremely restricted. This also usually excludes replenishment of con- 
sumables and volatile components (gases or liquids). 


Exposure to neutral particles: In low earth orbits (LEO, between 
200 and 2000 km height) atomic oxygen can be a major source of 
erosion; 


Exposure to charged particles: At about 400 km (orbit of the ISS), 
about 1% of the atmosphere is ionised. This fraction increases to 
100% at geosynchronous altitudes (37,000 km). This plasma is 
strongly influenced by the earth magnetic field and local densities 
vary significantly, also with height. The plasma can charge up ele- 
ments of the spacecraft. 


The Van Allen belts are regions where energetic ions and elec- 

trons experience long-term magnetic trapping. The energy of these 

trapped particles is greater than 30 keV and can reach hundreds 

of MeV. The intensity of the trapped radiation flux can reach up 

to 108 — 10° cm~2s7! at a distance of about 2 Rg for electrons with Rz is the earth radius (about 6371 km). 
Ex > 0.5 MeV and at about 3 Rg for Eg > 0.1 MeV protons [119]. 


The total dose collected during a space flight is relatively small, 
typically in the order of 10 Gy(Si). The long term damage from ra- 
diation will thus be negligible. However, there can be Single Event 
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Figure 14.8: Earth’s magnetosphere in 
the noon-midnight plane [400]. The 
dashed lines are the original dipole 
field. The solid lines are magnetic fields 
modified by external currents. IMF 
stands for interplanetary magnetic field, 


Geomagnetic tail 
l; which is the component of the solar 
Solar wind Neutral sheet 3 á 
1 i magnetic field that is dragged out from 
0R the solar corona by the solar wind flow 


to fill the solar system. 


Magnetopause 


Plasmasphere 
d Van Allen belts 


Effects (SEE) caused by single, heavily ionising particles depositing 
large energy in the electronics (see section 3.9). 


SPACE-BORNE DETECTORS FOR CHARGED PARTICLES 


A class of space-borne particle physics experiments measures composi- 
tion, rates and energy spectra of the charged particle flux around earth, 
comprising a large range of ions. The central component of such an ex- 
periment is a magnetic spectrometer, typically instrumented with silicon 
detectors. An example for such an experiment is AMS-02 [30], which is 
part of the International Space Station (ISS). 

AMS-02 consists of several detector systems. The central part is a 
magnetic spectrometer. To allow for an extended operation (>3 y) of 
the experiment, without the need to refill cryogenic liquids, a permanent 
magnet is used. It consists of 64 sectors, each made of 100 high-grade 
NdFeB blocks, assembled in a cylindrical shell 800 mm high with an inner 
diameter of 1115 mm. The segments are oriented in a ‘magic ring’ con- 
figuration to produce a dipole field of 0.14 T perpendicular to the axis at 
the centre of the magnet, while minimising any stray field (3-4 1074 T 
anywhere at a distance larger than 2 m from the centre). The total weight 
of the magnet including the support structure is 2.2 t. 


Figure 14.9: ‘Magic ring’ configura- 
tion [119]. In this configuration the 
fields from the magnetic elements in the 
ring add up to a vertical dipole field 
across the ring with little stray field 
outside. 


Figure 14.10: The AMS-02 experiment. Left: Schematics of the experiment [462]. Right: Charges reconstructed in the tracker and the TOF sys- 
tem [30]. 
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Particle tracks are measured by nine layers of silicon detectors, which, 
after combination with the magnetic field map, yields the track rigidity 
R = pc/Q, where p is the momentum and Q the charge of the incoming 
particle. The silicon detectors do have analogue readout to allow for a 
measurement of the charge of the incoming particle (as the radius of the 
track in the spectrometer scales with Q~'), but to provide the dynamic 
range to accommodate the large range of energy deposition for different 
ions, the pre-amplifier has been given a highly non-linear characteristics. 
The position is reconstructed from the distribution of charges in the hit 
strips. The position resolution depends on the inclination angle and, due to 
the non-linear response, on the charge of the incoming particle [55]. 

A Transition Radiation Detector (TRD) is located at the top of 
AMS-02. Its main purpose is to identify electrons and positrons by tran- 
sition radiation while rejecting protons at a level of 107°. The TRD also 
provides an independent tracking capability, and determination of the 
charge value of the nuclei by measuring the rate of energy loss (dE/dx). 
It is made of 6 mm straw tubes with a Xe/COz2 gas mixture, interleaved 
with a polyethylene/polypropylene fibre fleece radiator. The leak rate of 
the detector gas is sufficiently low to achieve >30 y of operation with the 
on-board gas supply (~55 kg). 

Time-of-flight counters (TOF) consisting of two planes of scintilla- 
tion counters with PMT readout above and two planes below the magnet 
provide a charged particle trigger to AMS, determine the direction and ve- 
locity of incoming particles, and measure the charge via dE/dx. The time 
resolution of each counter is 160 ps for Z = 1 nuclei. The timing resolu- 
tion improves with increasing magnitude of the charge to a limit of 50 ps 
for Z > 6 nuclei. Anti-coincidence counter (ACC) surrounding the experi- 
ments veto events that could be contaminated by sideways going particles. 

Below the spectrometer is a proximity-focusing RICH, with NaF crys- 
tal (n = 1.33) and aerogel (n = 1.05) radiators, an expansion volume with 
a depth of 470 mm, and segmented PMTs for photon detection. 

The electromagnetic calorimeter [21] is made of 1 mm thick grooved 
lead foils interleaved with scintillating fibres with a lead/fibre/glue 
volume ratio of 1/0.57/0.15, read out with PMTs. The total thickness 
of the ECAL is 17 Xo. The resolution measured in test beams was 
Og /E = (10.4+0.2)%/,/E/GeV @ (1.4£0.1)%. 

The overall AMS-02 experiment measures 5 x 4 x 3 m3, and has a 
weight of 7.5 t. 


SPACE-BORNE GAMMA RAY DETECTORS 


An example for a space-born gamma ray telescope is the Fermi Gamma- 
Ray Space Telescope (FGST). It includes two instruments, the Large Area 
Telescope (LAT) [107], and the Gamma-ray Burst Monitor (GBM) [366]. 
It occupies an 565 km orbit with an inclination of 25.6 degrees. 

The LAT is an imaging gamma-ray detector that detects photons with 
energies from 20 MeV to 300 GeV, with a field of view of about 20% of 
the sky. High-energy y-rays cannot be optically focused. Thus, the LAT 
is a pair-conversion telescope, which consists of a precision converter- 
tracker and a calorimeter. The converter-tracker has 16 planes of high-Z 
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Figure 14.11: Position resolution in the 
AMS-02 silicon tracker as a function of 
the track inclination [55]. 
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material (tungsten), in which y-rays can convert to an ete" pair, inter- 
leaved with 36 planes of silicon strip detectors, and spaced by carbon-fibre 
composite structures. The first twelve planes of tungsten are each 2.7% Xo 
(0.095 mm) in thickness, while the final four are each 18% Xo (0.72 mm) 
in thickness. The complete stack of tungsten converts about 63% of y 
rays with an energy above 1 GeV (above 1 GeV the pair-conversion cross- 
section saturates, see section 2.1). The longitudinal separation is chosen 
to give good angular resolution. For photons with low energies, the hits 

in the first two layers following the conversion are the most relevant, as 
multiple scattering of the e*e~ pair degrades the measurement of their 
position and direction. Also, the detector layers are located immediately 
behind the tungsten foils, so that only the multiple scattering in the first 
absorber layer is relevant. The lateral segmentation into strips with a pitch 
of 228 um and the ratio of strip pitch to vertical spacing between tracker 
planes of 0.0071 has been chosen to achieve good direction reconstruction 
for high energy photons (>1 GeV). 

The FGST-LAT calorimeter consists of bars of CsI crystals, read out 
by p-i-n diodes. The total vertical depth of the calorimeter is 8.6 Xo (for a 
total instrument depth of 10.1 Xo). The depth is a compromise in shower 
containment against maximum permitted mass. The calorimeter is seg- 
mented into eight longitudinal segments and laterally into units with a 
width corresponding to one Moliére radius. The fine segmentation is used 
to improve the energy measurement at high energies using a shower pro- 
file analysis. 

An anticoincidence detector (ACD) made of plastic scintillator tiles 
read out with PMTs surrounds the experiment to reject charged particle 
backgrounds. 

The FGST-LAT dimensions are 1.8 x 1.8 x 0.72 m°. The power re- 
quired and the mass are 650 W and 2.8 t, respectively. 
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The primary role of the FGST-GBM is to complement the LAT 
with observations at lower energies (~8 keV to ~40 MeV). It consists 
of 14 scintillation detectors (twelve NalI(TI) crystals for the 8 keV to 
1 MeV range and two BGO crystals with sensitivity from ~200 keV to 
~40 MeV), and can detect gamma-ray bursts in that energy range across 
the whole of the sky not occluded by the Earth. 

The Nal(TI) crystal disks have a diameter of 5” and a thickness of 
0.5”. On the front face, there is a entrance window made of a 0.2 mm 
thick Be sheet and a 0.7 mm thick silicone layer, which defines the low 
energy threshold of 8 keV. The Nal(T) scintillators are distributed on the 
surface of the spacecraft pointing in different directions, so that the direc- 
tion of a gamma ray burst can be determined from the relative count rates. 
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Figure 14.12: Reconstruction of a 
470 MeV photon conversion in one of 
two FGST tracker towers operated 
together with calorimeter modules 
in a tagged-photon beam test at 
CERN [106]. 


Figure 14.13: FGST-LAT performance 
as a function of photon energy at 
normal incidence (solid curve) and for 
incoming particles at an angle of 60° 
off-axis (dashed curve) for 68% con- 
tainment [107]. Left: Angular resolution 
for conversions in the thin foil section 
of the tracker. Right: Energy resolution. 
The energy resolution is worse for very 
high energy because of shower leakage. 


The combination of LAT and GBM 
provides the possibility to measure 

y rays over seven orders of magnitude in 
energy. 
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The two BGO scintillators have a diameter and a length of 5”. They are 
positioned at opposite sides of the spacecraft so that any burst above the 
horizon will be visible to at least one of them. The scintillation light from 
the crystals is read out by PMTs, one each for the NalI(T1) crystals and 
two each for the BGO crystals, to improve light collection and to provide 
redundancy. 
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14.5 RELIABILITY 


Large detector systems with many channels often need to operate for a 
long period of time. In cases like the inner detectors of collider experi- 
ments access is almost impossible because of the convoluted nested geom- 
etry in these experiments and activation of the detector material. There- 
fore, reliability is a major concern. Reliability is even more critical in 
space missions in which it is (almost) impossible to carry out any main- 
tenance or repairs. Failures can occur at many levels, electrically or me- 
chanically. 

A key strategy to achieve reliability is rigorous testing of all compo- 
nents during design, prototyping and construction of a detector. In gen- 
eral, at the development level components should be exposed to conditions 
more extreme than expected in operation to identify failure modes and if 
possible make changes to improve reliability. Typical stresses for elec- 
tronics include temperature, and voltage or current. For HEP applications 
radiation damage is usually an important stress. These tests should also 
be repeated on a batch basis during production of the final components to 
check for variations in the production processes. 

A complementary methodology, which aims to predict long-term per- 
formance of components, is Highly Accelerated Stress Test (HAST) in 
which the Mean Time To Failure (MTTF) is measured as a function of ap- 
propriate stress variables. Phenomenological models are then used to fit 
the data. The models can then be used to predict the lifetime at the oper- 
ating conditions. A simple example of this approach are reliability studies 
of vertical cavity surface emitting lasers (VCSELs) used for data transmis- 
sion from the detector, for which the MTTF scales with temperature T and 
current J according to the Arrhenius equation [472] 


él exp (—Ea /kpT?) 
h exp (—Ea/kp7\)’ 


MTTF(,71) _ 


(14.2) 
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Figure 14.14: FGST-GBM [366]. Left: 
Mounting of crystals on spacecraft 
(0-11 Nal(Tl), 12-13 BGO). Right: En- 
ergy resolution for the two types of 
scintillators. 


Much of the methodology of relia- 
bility engineering was developed by 
NASA [493]. 


Performance verification during the 
design phase is referred to as ‘qual- 
ity assurance’ (QA), and during the 
construction as ‘quality control’ (QC). 


This methodology is called Highly 
Accelerated Lifetime Tests (HALT). 


Testing of final production components 
at elevated stress levels is referred to 
as Highly Accelerated Stress Screen 
(HASS). 
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where EF’, is the activation energy. The value of Ea can be determined 
from a fit to accelerated ageing tests at different currents and temperatures. 
A similar approach is used for other devices, in which accelerated ageing 
data is used to fit different phenomenological models, thus allowing an 
extrapolation of MTTF to the operating conditions. 

Another strategy to improve reliability is the use of ‘redundancy’ in 
which if one component fails then another one can perform the same func- 
tion. Redundancy should always be used if human safety is involved or if a 
single point failure would be ‘mission critical’. However, it is usually im- 
practical to use redundancy for all channels of a large detector. Additional 
detector layers (e.g. in a tracking detector) can, in addition to sometimes 
improving system performance (as long as the additional material has no 
detrimental effects), also provide some measure of redundancy. Redun- 
dancy does not guarantee reliability, for example it does not achieve relia- 
bility if components or sub-system lifetimes are shorter than the required 
operation time of the detector. 

Newer Deep Sub-Micron (DSM) chip technologies have provided dra- 
matic improvements in performance but at the cost of decreased reliability. 
One of the failure modes is due to electromigration [493]. The decrease in 
transistor feature size results in a decrease in the height and cross-section 
of the conductors. Therefore, there can be very large electron current den- 
sities which can ionise the atoms which then move in the electric field. 
This can result in voids in the conductor of the tracks connecting the com- 
ponents in the chip. These decrease the effective area of the conductor and 
can lead to catastrophic thermal failure due to high local current densities. 

In general, the reliability of electronics can be improved by decreasing 
the operating currents/voltages and running at lower temperature. How- 
ever, this might conflict with the requirements for channel density, noise 
performance etc. 

In addition, all components and sub-systems need to be tested dur- 
ing and after production to check that they meet the specifications before 
being integrated into larger structures. Components can also be operated 
before installation for longer periods of time than required to validate the 
performance in an attempt to weed out ‘infant mortality’. 

Other failure modes of electronic systems include cracks in thin con- 
ductors and connector failures. The desire to minimise material results in 
thinner, more fragile conductors and very dense small form factor connec- 
tors, which can be less robust. 

A typical mechanical failure would be failure of a glue joint, or fail- 
ure of a part due to unexpected loads, either due to the magnitude or the 
frequency of high-load events, or because of a manufacturing flaw. Sim- 
ilarly to electrical systems, mechanical properties also need to be under- 
stood and appropriate procedures need to be developed. Mechanical de- 
signs can also be guided by Finite Element Analysis (FEA). International 
engineering standards are a good guidance to help design systems with 
comfortable safety margins, although the requirements of particle physics 
experiments (for example the minimisation of scattering material) can 
sometimes justify a more aggressive approach. In this case extensive pro- 
totyping is required to validate the approach. 
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A typical value would be E, ~ 1 eV. 


Some small failure rates always have 
to be accepted and considered in the 
design of the experiment. 


Other failure modes are Time- 
Dependent Dielectric Breakdown, Hot 
Carrier Injection (HCI) and Negative 
Bias Temperature Instability [493]. 


At low temperatures the carrier mean 
free paths become longer and therefore 
a ‘hot’ carrier (i.e. a carrier with an 
energy larger than average) can acquire 
sufficient energy in an electric field to 
reach the gate oxide of a CMOS tran- 
sistor. The build-up of charge can result 
in the transistor failing. This mecha- 
nism is called ‘hot carrier injection’ 
(HCI) [493]. HCI is more severe at 
lower temperatures and is thus a ma- 
jor issue for electronics in cryogenic 
detectors. 


Tonising radiation can break chemical 
bonds and therefore reduce the strength 
of a glue. 


This involves a combination of testing 
prototypes to a higher stress than ex- 
pected during operation (and repeating 
these tests on a batch basis) and testing 
every part when this is practical. 


No compromises can be made if per- 
sonal safety is involved. 
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Detectors are usually assembled at room temperature and therefore, if 
they are operated at a different temperature, there will be thermal stresses. 
Hence, performance should be verified under thermal cycling during de- 
velopment and in production. 

Space missions have to survive very high acceleration during launch, 
therefore all components must pass rigorous vibration testing. 
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Key lessons from this chapter 


Particle physics experiments usually rely on the whole range of detector techniques 
discussed in this book, to achieve mutually supporting, complementary and/or redun- 
dant measurements. 


The integration of these different detectors involves a challenging optimisation to 
achieve low material, coverage, compactness and low cost, requiring careful optimisa- 
tion and compromises. 


LHC detectors are probably the most complex pieces of scientific equipment ever 
built. Very extreme challenges in terms of data rates and radiation damage needed to 
be understood. 


Neutrino detectors require very large fiducial mass, and the challenge is to develop 
powerful yet affordable detectors. 


Rare decay experiments have extreme requirements for controlling backgrounds like 
radioactivity. 


Modern space experiments like AMS-02 and FGST are similar in many ways to large 
terrestrial high energy physics detectors, with added challenges due to launch forces, 
environmental conditions, and the impossibility to service the detector. 


Detectors in large and complex experiments have very demanding reliability require- 
ments. 


EXERCISES 


1. Solenoidal magnetic fields and muon triggers. 


Consider a cylindrical detector with a solenoid magnetic field of magnitude B. The flux of the 


solenoid is returned in a yoke made of iron. The solenoid has a radius R; and the flux return ex- 
tends from R; to a radius of Ro, 


a) 
b) 


c) 


d) 


Explaining any assumptions you make, show that ie > B(r)dr =0. 


2. A rare decay search experiment. 


An experiment is to be designed to look for the decay p > u*7°. 


Consider a charged particle created in a collision on the solenoid axis moving in the plane 
perpendicular to the B field. Write down an expression for the force on the particle and deter- 
mine the resultant torque. 


What is the net change in angular momentum of the charged particle from creation to when it 
reaches a radius R2? 


What are the implications for a possible muon trigger system based purely on the measure- 
ment of the muon trajectory outside the return yoke for this geometry? 


a) Assuming that the partial decay rate for this decay mode is 1/(10*” y), make a rough estimate 


of the mass of protons required in order to have a measurable counting rate. 


306 Detectors in Particle Physics 


b) What would be the experimental signature of such a decay in a detector based on the 
Cherenkov effect? Suggest a suitable choice for the Cherenkov material, and give a basic 
outline of the overall detector. What sources of background will need to be considered? How 
could these backgrounds be reduced? 


3. Powering. 


A power supply system is required to deliver a current Jp at a voltage of Vo (between input and re- 
turn lines) for N modules. The resistance of the cables carrying the current from the power supply 
to the modules is R. 


Calculate the power dissipated in the cable(s) and the voltage drop along the cable(s) for the four 
cases below, explaining any approximations you use. Why do we need to minimise the power dis- 
sipation in the cables and the voltage drop along the cables? What are the advantages and disad- 
vantages of each approach? 


a) Individual module powering. Each module is powered by an independent power supply and 
cable. 


b) Parallel powering. All modules are powered by one power supply and one cable. 


c) Serial powering. One power supply is used to feed the current required by one module 
through one cable to the first module and all N modules are connected in series. 


d) DC-DC Converters. A power supply outputs a voltage of NVo and is connected in parallel to 
all N modules. A DC-DC converter on each module transforms the voltage from NVo to Vo. 
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